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CHEMICALLY MODIFIED PROTEINS WITH A CARBOHYDRATE MOIETY 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Patent Application 
Serial No. 60/09 L687. filed July 2, 1998, and U.S. Provisional Patent Application Serial 
5 No. 60/1 3 1 ,446, filed April 28, 1 999, and which are hereby incorporated by reference. 

FIELD OF THE INVENTION 

The present invention relates to chemically modified mutant proteins 
having modified glycosylation patterns with respect to a precursor protein from which 

0 they are derived. In particular, the present invention relates to a chemically modified 

mutant protein including a cysteine residue substituted for a residue other than cysteine in 
a precursor protein, the substituted cysteine residue being subsequently modified by 
reacting the cysteine residue with a glycosylated thiosulfonate. The present invention 
also relates to a method of producing the chemically modified mutant proteins and a 

5 glycosylated methanethiosulfonate. Another aspect of the present invention is a method 
of modifying the functional characteristics of a protein by reacting the protein with a 
glycosylated methanethiosulfonate reagent. The present invention also relates to methods 
of determining the structure-function relationships of chemically modified mutant 
proteins, 

0 

BACKGROUND OF THE INVENTION 

Modifying enzyme properties by site-directed mutagenesis has been 
limited to natural amino acid replacements, although molecular biological strategies for 
overcoming this restriction have recently been derived (Cornish et al., Angew. Chem. , Int. 
5 Ed. Engl., 34:621-633 (1995)). However, the latter procedures are difficult to apply in 
most laboratories. In contrast, controlled chemical modification of enzymes offers broad 
potential for facile and flexible modification of enzyme structure, thereby opening up 
extensive possibilities for controlled tailoring of enzyme specificity. 
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Changing enzyme properiies by chemical modification has been explored 
previously, with the first report being in 1966 by the groups of Bender (Polgar et al.. 1 
Am. Chem. Soc . 88:31 53-3 1 54 ( 1 966)) and Koshland (Neet et al.. Proc. Natl. Acad. Sci. 
USA. 56:1606-161 1 (1966)). who created a thiolsubtilisin by chemical transformation 
(CH,OH CH,SH) of the active site serine residue of subtilisin BPN" to cysteine. 
Interest in chemically produced artificial enzymes, including some with synthetic 
potential, was renewed by Wu (Wu et al.. .1 Am. Chem. Soc. 1 1 1 :45 1 4-45 1 5 ( 1 989): Bell 
et al.. Biochemistry . 32:3754-3762 ( 1 993)) and Peterson (Peterson et al.. Biochemistry . 
34:6616-6620 (1995)). and. more recently. Suckling (Suckling el al., Bioora. Med. Chem. 

LetL, 3:531-534 (1993)). 

Enzymes are now widely accepted as useful catalysts in organic synthesis. 
However, natural, wild-type, enzymes can never hope to accept all structures of synthetic 
chemical interest, nor always be transformed stereospecifically into the desired 
enantiomerically pure materials needed for synthesis. This potential limitation on the 
synthetic applicabilities of enzymes has been recognized, and some progress has been 
made in altering their specificities in a controlled manner using the site-directed and 
random mutagenesis techniques of protein engineering. However, modifying enzyme 
properties by protein engineering is limited to making natural amino acid replacements, 
and molecular biological methods devised to overcome this restriction are not readily 
amenable to routine application or large scale synthesis. The generation of new 
specificities or activities obtained by chemical modification of enzymes has intrigued 
chemists for many years and continues to do so. 

U.S. Patent No. 5,208,158 to Bech et al. ("Bech") describes chemically 
modified detergent enzymes where one or more methionines have been mutated into 
cysteines. The cysteines are subsequently modified in order to confer upon the enzyme 
improved stability towards oxidative agents. The claimed chemical modification is the 
replacement of the thiol hydrogen with C,.e,alkyl. 

Although Bech has described altering the oxidative stability of an enzyme 
through mutagenesis and chemical modification, it would also be desirable to develop one 
or more enzymes with altered properties such as activity, nucleophile specificity, 
substrate specificity, stereoselectivity, thermal stability, pH activity profile, and surface 
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binding properties for use in. for example, detergents or organic synthesis. In particular, 
enzymes, such as subtihsins, tailored for peptide synthesis would be desirable. Enzymes 
useful for peptide synthesis have high esterase and low amidase activities. Generally, 
subtilisins do not meei these requirements and the improvement of the esterase to amidase 
selectiviiics of subtilisins would he desirable. However, previous attempts to tailor 
enzymes for peptide synthesis by lowering amidase activity have generally resulted in 
dramatic decreases in both esterase and amidase activities. Previous strategies for 
lowering the amidase activity include the use of water-miscible organic solvents (Barbas 
et aL, J. Am. Chem. Soc , 110:51 62-5 1 66 ( 1 988); Wong et ah. J. Am. Chem. Soc . , 
1 12:945-953 (1990): and Sears et al., Biotechnol. Prog. . 12:423-433 (1996)) and site- 
directed mutagenesis (Abrahamsen et al.. Biochemistry . 30:4151-4159 (1991); Bonneau 
et ah, ''Alteration of the Specificity of Subtilisin BPN' by Site-Directed Mutagenesis in its 
SI andSr Binding-Sites." J. Am. Chem. Soc . 113:1026-1030(1991); and Graycar et al.. 
Ann. N. Y. Acad. Sci. . 67:71-79 (1992)). However, while the ratios of esterase-to- 
amidase activities were improved by these approaches, the absolute esterase activities 
were lowered concomitantly. Abrahamsen et al.. Biochemistry . 30:4151-4159 (1991). 
Chemical modification techniques (Neet et ah, Proc. Nat. Acad. Sci. , 56:1606 (1966): 
Polgar et ah, J. Am. Chem. Soc . 88:3153-3154 (1966); Wu et al.. J. Am. Chem. Soc . 
111:451 4-45 15 (1 989); and West el aL, J. Am. Chem. Soc . 112:531 3-5320 ( 1 990)), 
which permit the incorporation of unnatural amino acid moieties, have also been applied 
to improve esterase to amidase selectivity of subtilisins. For example, chemical 
conversion of the catalytic triad serine (Ser221) of subtilisin to cysteine (Neet et al., Proc 
Nat. Acad. Sci. . 56: 1606 (1966); Polgar et al., J. Am. Chem. Soc , 88:3153-3154 (1966); 
and Nakatsuka et al., J. Am. Chem. Soc . 109:3808-3810 (1987)) or to selenocysteine 
(Wu et al., J. Am. Chem. Soc . 1 1 1 :4514-4515 (1989)), and methylation of the catalytic 
triad histidine (His57) of chymotrypsin (West et al., J. Am. Chem. Soc , 1 12:5313-5320 
( 1 990)), effected substantial improvement in esterase-to-amidase selectivities. 
Unfortunately however, these modifications were again accompanied by 50- to 1000-fold 
decreases in absolute esterase activity. 

Surface glycoproteins act as markers in cell-ceil communication events 
that determine microbial virulence (Sharon et al., Essavs Biochem. , 30:59-75 (1995)), 
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intlammaiion (Lasky. Annu. Rev. Biochem. . 64: 11 3-1 39 ( 1 995): Weis et al.. Anna. Rev. 
Biochem. . 65:441 -473 (1996)). and host immune responses (Varki. Glycobiol. . 3:97-130 

(1993) : Dwek. Chem. Rev. . 96:683-720 ( 1996)). In addition, the correct glycosylaiion of 
proteins is critical to their expression and folding ( Helenius. N4ol. Biol. Cell. 5:253-265 

( 1 994) ) and increases their thermal and proteolytic stability (Opdenakker et al.. FASEB.I. - 
71 ■^^'^0-1337 ( 1993)). Glvcoproteins occur naturally in a number of forms (glycolorms) 
(Rademacher et al.. Annu. Rev. Biochem. . 57:785-838 (1988)) that possess the same 
peptide backbone, but differ in both the nature and site of glycosylation. The differences 
exhibited (Rademacher et al., Annu. Rev. Biochem. . 57:785-838 (1988): Parekh et al.. 
Biochem. . 28:7670-7679 (1989); Knight, Biotechnol. . 7:35-40 (1989)) by each 
component within these microheterogeneous mixtures present regulatory difficulties (Liu. 
Trends Biotechnol. . 10: 11 4- 120 (1992); Bill et al.. Chem. Biol.. 3:145-149 (1996)) and 
problems in determining exact function. To explore these key properties, there is a 
pressing need for methods that will not only allow thd preparation of pure glycosylated 
proteins, but will also allow the preparation of non-natural variants for the determination 
of structure-function relationships, such as structure-activity relationships (SARs). The 
few studies that have compared single glycoforms successfully have required abundant 
sources and extensive chromatographic separation (Rudd et al., Biochem. . 33:1 7-22 
(1994)). Neoglycoproteins (Krantzet al.. Biochem. . 15:3963-3968 (1976)), formed via 
unnatural linkages between sugars and proteins, provide an invaluable alternative source 
of carbohydrate-protein conjugates (For reviews see Stowell et al.. Adv. Carbohvdr. 
Chem. Biochem. . 37:225-281 (1980); Neoglvcocon ju pates: Preparation and Applications , 
Lee et al.. Eds.. Academic Press, London (1994): Abelson et al.. Methods EnzvmoL , 242: 
(1994): Lee et al.. Methods Enzvmol. . 247: (1994): Bovin et al., Chem. Soc. Rev.. 
24:413-421 (1995)). In particular, chemical glycosylation allows control of the glycan 
structure and the nature of the sugar-protein bond. However, despite these advantages, 
existing methods for their preparation (Stowell et al.. Adv. Carbohvdr. Chem. Biochem. . 
37:225-281 (1 980)) typically generate mixtures. In addition, these techniques may alter 
the overall charge of the protein (Lemieux et al.. .1. Am. Chem. Soc, 97:4076-4083 
(1975): Kobayashi et al.. Methods Enzvmoi. , 247:409-41 8 (1994)) or destroy the cyclic 
nature of glycans introduced (Gray, Arch. Biochem . Biophys., 163:426-428 (1974)). For 
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example, the reductive amination of lactose with bovine serum albumin (BSA) caused 
indiscriminate modificaiion of lysine residues through the formation of acyclic amines 
introduced (Gray, Arch. Biochem. Biophvs. . 163:426-428 ( 1974)). Advances in the site- 
specific glycosylation of BSA have been made (Davis et al., Tetrahedron Lett. , 32:6793- 
5 6796 ( 1 99 1 ): Wong et aL, Biochem. J. . 300:843-850 ( 1 994): Macindoe el al., J. Chem. 
Soc. Chem. Commun. . 847-848 (1998)). However, these methods rely upon modification 
of an existing cysteine in BSA and. as such, allow no flexibility in the choice of 
glvcosylaiion site. Glycoproteins occur naturally as complex mixtures of differently 
glycosylated forms which are difficult to separate. To explore their properties, there is a 
0 need for homogenous sources of carbohydrate-protein conjugates. Existing methods 
typically generate product protein mixtures of poorly characterized composition, with 
little or no control over the site or level of glycosylation. 

The present invention is directed to overcoming these deficiencies. ■ 

SUMMARY OF THE INVENTION 

It is an object of the present inventioji to provide for novel glycosy lated 

proteins. 

It is a further object of the invention to provide for novel glycosylated 
proteins which have modified or improved functional characteristics. 

It is a funher object of the invention to provide for a method'of producing 
glycosylated proteins which have well defined properties, for example, by having 
predetermined glycosylation patterns. 

According to the present invention, a method is provided wherein the 
glycosylation pattern of a protein is modified in a predictable and repeatable manner. 
Generally, the modification of the protein occurs via reaction of a cysteine residue in the 
protein with a glycosylated thiosulfonate. 

Thus, in one composition aspect of the present invention, a chemically 
modified mutant protein is provided, wherein said mutant protein differs from a precursor 
protein by virtue of having a cysteine residue substituted for a residue other than cysteine 
in said precursor protein, the substituted cysteine residue being subsequently modified by 
reacting said cysteine residue with a glycosylated thiosulfonate. Preferably, the 
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glycosylated thiosulfonaie is an alkylthiosuUonate. most preferably a 
melhanethiosulfonate. 

In a method aspect of the present invention, a method of producing a 
chemicall>' modified mutant protein is provided comprising the steps of: (a) providing a 

5 precursor protein; (b) substituting an amino acid residue other than cysteine in said 
precursor protein with a cysteine: (c) reacting said substituted cysteine with a 
glycosylated thiosulfonate. said glycosylated ihiosulfonate comprising a carbohydrate 
moiety: and (d) obtaining a modified glycosylated protein wherein said substituted 
cysteine comprises a carbohydrate moiety attached thereto. Preferably, the glycosylated 

10 thiosulfonate is an alkylthiosulfonate. most preferably, a methanelhiosuifonate. Also 
preferably, the substitution in said precursor protein is obtained by using recombinant 
DNA techniques by modifying a DNA encoding said precursor protein to comprise DNA 
encoding a cysteine at a desired location within the protein. 

The present invention also relates to novel glycosylated thiosulfonates. In 

15 a preferred embodiment, the glycosylated thiosulfonate is a methanethiosulfonate. In a 
most preferred embodiment, the glycosylated methanethiosulfonate comprises a chemical 
structure including: 

O 

II 

HoC— S— SR 

II 

O 

where R comprises -|3-Glc, -Et-p-Gal. -Et-P-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 
20 -p-Glc(Ac),, -p-Glc(Ac);„ -P-Glc(Ac),, -Et-a-Glc(Ac),. -Et-a-Glc(Ac),, -Et-a-Glc(Ac),, 
-Et-p-Glc(Ac)„ -Et'P-Glc(Ac),, -Et-P-Glc(Ac),. -Et-a-Man(Ac),, -Et-a-Man(Ac),, 
-Et-P-Gal(Ac),. -Et-p-Gal(Ac),. -Et-Lac{Ac),. -Et-Lac(Ac),, or -Et-Lac(Ac),. 

Another aspect of the present invention is a method of modifying the 
functional characteristics of a protein including reacting the protein with a glycosylated 
25 thiosulfonate reagent under conditions effective to produce a glycoprotein with altered 
functional characteristics as compared to the protein. Accordingly, the present invention 
provides for modified protein, wherein the protein comprises a wholly or partially 
predetermined glycosylation pattern which differs from the glycosylation pattern of the 
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proiein in its precursor, natural, or wild type slate and a method for producing such a 
modified protein. 

The present invention also relates to methods of determining the structure- 
function relationships of chemically modified mutant proteins. One method includes 

5 providing first and second chemically modified mutant proteins of the present invention, 
wherein the glycosylation pattern of the second chemically modified mutant protein 
differs from the glycosylation pattern of the first chemically modified mutant protein, 
evaluating a functional characteristic of the first and second chemically modified mutant 
proteins, and correlating the functional characteristic of the first and second chemically 

10 modified mutant proteins with the structures of the first and second chemically modified 
mutant proteins. Another method involves providing first and second chemically 
modified mutant proteins of the present invention, wherein at least one different cysteine 
residue in the second chemically modified mutant protein is modified by reacting^said 
cysteine residue with a glycosylated thiosulfonate, evaluating a functional characteristic 

15 of the first and second chemically modified mutant proteins, and correlating the 

functional characteristic of the first and second cherpically modified mutant proteins with 
the structures of the first and second chemically modified mutant proteins. 

The chemically modified mutant proteins of the present invention provide 
an alternative to site-directed mutagenesis and chemical modification for introducing 

20 unnatural amino acids into proteins. Moreover, the methods of the present invention 
allow the preparation of pure glycoproteins (i.e., not mixtures) with predetemiined and 
unique structures. These glycoproteins can then be used to determine structure-function 
relationships (e.g.. structure-activity relationships ("SARs")) of non-natural variants of 
the proteins. 

25 An advantage of the present invention is that it is possible to introduce 

predetermined glycosylation patterns into proteins in a simple and repeatable manner. 
This advantage provides an ability to modify critical protein characteristics such as 
partitioning, solubility, cell-cell signaling, catalytic activity, biological activity and 
pharmacological activity. Additionally, the methods of the present invention provide for 

30 a mechanism of "masking'' certain chemically or biologically important protein sites, for 
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example, sites which are critical for immunological or allergenic response or sites which 
are critical to proteolytic degradation of the modified protein. 

Another advantage of the present invention is the ability to glycosylate a 
protein which is not generally glycosylated, or to modify the glycosylation pattern of a 
5 protein which is generally glycosylated. 

Another advantage of the present invention is to produce enzymes which 
have altered catalytic activity. In one specific example, the inventors herein have shown 
that it is possible to modify the substrate specificity of a protease to increase the esterase 
activity as compared to the amidase activity. Similarly, modifications of substrate 
10 specificity would be expected when utiHzing the present invention with other enzymes. 

These and other advantages of the present invention are described in more 
detail in the following detailed description. 

BRIEF DESCRIPTION OF THE DRAWINGS 

15 Figure 1 shows dendrimer methanethiosulfonate ("MTS") reagents. 

Figure 2 shows the synthesis of a first generation glycodendrimer reagent 
which bears two D-mannose units on its termini and has one arm as a MTS which can be 
attached to a subtilisin Bacillus lentus cysteine mutant. 

Figure 3 shows the synthesis of highly-functionalized glycodendrimer- 
20 protein conjugates. 

Figure 4 shows two parallel synthetic approaches to modification of 
subtilisin Bacillus lentus with dendrimers. Both approaches allow the use of a large 
library of methanethiosulfonate reagents (R-SSO.IVIe) to cap the dendrimeric branches. 
The routes shown allow for the preparation of both dimeric and trimeric dendrimers 
25 Figure 5 shows peptide coupling catalyzed by an enzyme. 

Figure 6 shows the preparation of two types of glycosylating reagents from 
D-glucose (2a): the anomeric methanethiosulfonate la and the ethyl -tethered 
methanethiosulfonates lb, c, g, h. 

Figure 7 shows the preparation of the a-D-manno-MTS reagents Id and li, 
30 which are epimeric at C-2 relative to lb and Ig, respectively, and the p-D-galacto-MTS 
reagents le and Ij, epimeric at C-4 relative to Ic and Ih, respectively. 
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Figure 8 shows the reaction of the thiol residue of cysteine introduced into 
subtihsin Bacillus lenius with glycomethanethiosulfonate reagents. 

Figure 9 shows the reaciion of 2.3.4. 6-tetra-O-acetyl-P-D-gUicopyranosyl 
niethaneihiosulfonaie (la) with subiilisin Bacillus lenfiis-N62C. -S156C. -S166C, - 
L217C. 

Figures lOA-D show deproiecied glycan structure-proieolytic activity 
SARs of subtihsin Bacillus lenius cysteine mutants and glycosylated chemically modified 
mutant enzymes ("CMMs") relative to wild-type C'WT''). A break in the axis indicates 
that the value was not determined. At position 62, glycosylation partially restores the 
decrease in Av,,/A'\/ caused by mutation to cysteine (R = H) (Figure lOA). At position 217, 
the 4-fold decrease in activity caused by mutation (R = H) is amplified to 6-fold lower 
than WT by glycosylation with untethered S-P-Glc, but reduced to around 2.5-fold lower 
than WT by glycosylation with ethyl-tethered glycans (b-f) (Figure lOB). At position 
156. an arced variation in activity reaches a 3-fold lower than WT minimum k^JK^^faX 
bulky lacto-CMM SI 56C-S-f (Figure IOC). At position 166, the 2.5-fold decrease in 
/:^.„/A'^v/ caused by mutation is amplified by glycosylation. A^.^y/T^^/ decreases monotonically 
from S166C (R = H) to a value that is 3.8-fold lower than WT for SI 66C-S-f (Figure 
lOD). 

Figures 1 1 A-D show the acetylated glycan structure-proteolytic activity 
SARs of glycosylated chemically modified mutant enzymes relative to WT. For each 
glycan the number of acetate groups present is indicated by a label on the corresponding 
bar. A break in the axis indicates that the value was not determined. At positions 62 
(Figure 1 1 A), 217 (Figure 1 IB), and 166 (Figure 1 ID) an alternating trend in activity is 
observed as a result of the opposite effects of acetylation upon k^JKj^, according to 
anomeric stereochemistry (see Figure 12). This results in a k^JK^^f for N62C-S-g that is 
1.1 -fold higher than WT. At position 156 (Figure 1 IC), variations are slight and this is 
consistent with its surface exposed orientation. 

Figures 12A-D show the variation in proteolytic activity of glycosylated 
chemically modified mutant enzymes of subtihsin Bacillus lenfus upon acetylation of 
gl yeans. Comparison of the activity of acetylated with fully deprotected chemically 
modified mutant enzymes shows that at positions 62 (Figure 12A) and 217 (Figure 12B) 
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acetylation enhances the activity of a-tethered chemically modified mutant enzymes but 
decreases that of p-tethered. In contrast, at position 166 (Figure 12D). acetylation 
decreases the activity of a-iethered CMMs but increases that of p-tethered. Consistem 
with its surface exposed orientation, changes at position 156 (Figure 12C) are modest. 
For each glycan the number of acetate groups present is indicated by a label on the 
corresponding bar. A break in the axis indicates that the value was not determined. 

Figures 13.a\-D show esterase kJK,,s for deprotected glyco-CMMs 
relative to WT. At position 62 (Figure 13A). in the S, pocket, glycosylaiion leads to a 
series of enzymes that have similar activities that are 1 .3- to 1 .9-fold greater than WT. 
At position 217 (Figure 13B). in the S,' pocket, glycosylaiion also increases kJK,,. to a 
maximum 3.5-fold greater than WT for L217C-SEtGa! (-e). At position 156 (Figure 
13C), in the S, pocket, glycosylation leads to a reduction in kJK„. At position 166 
(Figure 13D), in the S, pocket, the dramatic loss of activity upon mutation to cysteine (R 
= H) is restored by glycosylation. All Five S166C deprotected glyco-CMMs have similar 
5 k^ JK^s that are 1 . 1 - to 1 .4-fold lower than WT. 

Figures 14A-D show the effect of acetylation on kJK„oi glyco-CMMs. 
For each glycan the number of acetate groups present is indicated by a label on the 
corresponding bar. A break in the axis indicates that the value was not determined. At 
positions 62 (Figure 14A). in the S, pocket, and 21 7 (Figure 14B), in the S,' pocket, the 
20 effect ofacetylation is dependent on anomeric stereochemistry. At both sites, acetylation 
of a-linked sugars (-b,-d) leads to an increase in kJK„. whereas kJK,, is decreased for 
' p-linked sugars (-c.-e,-f). kJK,, &\so increases as the number of acetates increases. 
Consistent with the surface-exposed nature of its side chain, acetylation at position 1 56 
(Figure 14C), in the S, pocket, has very little effect on kJK,, At position 166 (Figure 
25 1 4D). in the S , pocket, the effects of acetylation are opposite to those observed at 

positions 62 and 217. Acetylation increases kJK,, of P-linked glyco-CMMs (-c,-e), 
while causing a decrease for the a-linked glyco-CMMs (-b,-d). 

Figures 1 5A-D show the E/A of deprotected glyco-CMMs relative to WT. 
A break in the axis indicates that the value was not determined. Glycosylation with 
30 deprotected reagents 1 b-f increases the E/A ratio in all cases. The greatest effects are 

observed at positions 62. in the S, pocket (Figure 1 5 A), and 2 1 7, in the S,' pocket (Figure 
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15B). where the largest increases in E/A result in values up to 10-9'fold greater than VVT. 
At both sites the E/A is higher for the p-linked glyco-CMMs than the a-linked ones. At 
positions 156 (Figure 15C) and 166 (Figure 15D). in the Sj pocket, there is little variation 
in E/A. 

5 Figures 16A-D show the effect of introducing aceiylated glycans on E/A. 

For each glycan the number of acetate groups present is indicated by a label on the 
corresponding bar. A break in the axis indicates that the value was not determined- 
Values greater than zero indicate acetylalion increases E/A, negative values denote a 
reduction in E/A upon acetylalion. 

10 Figure 17 shows the modeling of the high esterase activity of L217C-S- 

Glc(Ac)3. A minimized structure of L21 7C-S-Glc(Ac), of the active site of SBL showing 
the catalytic residues Ser22K His64. The carbon atoms of the triacetyated D-glucose 
moiety, which is bound to Leu217 via a disulfide bond, are numbered. The phenyl ring of 
product AAPF occupies the Sj binding site and forms a crucial hydrogen bond (k.72 A) 

15 to Wat 127. This water molecule is further stabilized by a second hydrogen bond (1 .89 A) 
to the carbonyl O of the C-2 acetate group of glucose. 

Figure 18 shows the proposed acyl-enzyme intermediate of L217C-S- 
Glc(Ac),. The carboxy terminus of AAPF forms a bond to the O.^ atom of Ser22F. 
Wat 127 acting as the crucial deacylating nucleophilic water molecule, is stabilized 

20 through its hydrogen bond to the carbonyl group of the C-2 acetate group. 

DETAILED DESCRIPTION OF THE INVENTION 

According to the present invention, a method is provided wherein the 
25 glycosylalion pattern of a protein is modified in a predictable and repeatable manner. 

Generally, the modification of the protein occurs via reaction of a cysteine residue in the 
protein with a glycosylated thiosulfonate. 

Thus, in one composition aspect of the present invention, a chemically 
modified mutant protein is provided, wherein said mutant protein differs from a precursor 
30 protein by virtue of having a cysteine residue substituted for a residue other than cysteine 
in said precursor protein, the substituted cysteine residue being subsequently modified by 
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reacting said cysteine residue with a glycosylated thiosulfonate. Preferably, the 
glycosylated thiosulfonate is an alkylthiosulfonate. most preferably, a 
methanethiosulfonate. 

In a method aspect of the present invention, a method of producing a 
chemically modified mutant protein is provided comprising the steps of: (a) providing a 
precursor protein: (b) substituting an ammo acid residue other than cysteine in said 
precursor protein with a cysteine: (c) reacting said substituted cysteine with a 
glycosylated thiosulfonate. said glycosylated thiosulfonate comprising a carbohydrate 
moiety: and (d) obtaining a modified glycosylated protein wherein said substituted 
cysteine comprises a carbohydrate moiety attached thereto. Preferably, the glycosylated 
thiosulfonate is an alkylihiosulfonate, most preferably, a methanethiosulfonate. Also 
preferably, the substitution in said precursor protein is obtained by using recombinant 
DNA techniques by modifying a DNA encoding said precursor protein to comprise DNA 
encoding a cysteine at a desired location within the protein. The amino acid residues to 
be substituted with cysteine residues according to the present invention may be replaced 
using site-directed mutagenesis methods or other methods well known in the art. See, for 
example, PCT Publication No. WO 95/10615, which is hereby incorporated by reference. 

The present invention also relates to a glycosylated thiosulfonate. 
Preferably, the glycosylated thiosulfonate comprises methanethiosulfonate. More 
preferably, the methanethiosulfonate comprises the chemical structure: 

O 
II 

H3C— S— SR 

II 

O 

where R comprises -P-Glc, -Et-p-Gal, -Et-p-Glc, -Et-a-Glc, -Et-a-Man. -Et-Lac, 
-P-Glc(Ac),, -p-Glc(Ac),, -p-Glc(Ac),, -Et-a-Glc(Ac)„ -Et-a-Glc(Ac).. -Et-a-Glc(Ac)„ 
-Et-P-Glc(Ac)„ -Et-p-Glc(Ac)„ -Et-p-Glc(Ac),, -Et-a-Man(Ac),, -Et-a-Man(Ac)4. 
-Et-p-Gal(Ac)3, -Et-P-Gal(Ac)4, -Et-Lac(Ac)3, -Et-Lac(Ac),, or -Et-LacCAc)^. 

Another aspect of the present invention is a method of modifying the 
functional characteristics of a protein including providing a protein and reacting the 
protein with a glycosylated thiosulfonate reagent under conditions effective to produce a 
glycoprotein with altered functional characteristics as compared to the protein. 
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Accordingly, the present invention provides for modified protein, wherein the protein 
comprises a wholly or partially predetermined glycosylation pattern which differs from 
the glycosylation pattern of the protein in its precursor, natural, or wild type state and a 
method for producing such a modified protein. As used herein, glycosylation pattern 
5 means the composition of a carbohydrate moiety. The present invention also relates to 
methods of determining the structure-function relationships of chemically modified 
mutant proteins. The first method includes providing first and second chemically 
modified mutant proteins of the present invention, wherein the glycosylation pattern of 
the second chemically modified mutant protein differs from the glycosylation pattern of 

10 the first chemically modified mutant protein, evaluating a functional characteristic of the 
first and second chemically modified mutant proteins, and correlating the functional 
characteristic of the first and second chemically modified mutant proteins with the 
structures of the first and second chemically modified mutant proteins. The second 
method involves providing first and second chemically modified mutant proteins of the 

15 present invention, wherein at least one different cysteine residue in the second chemically 
modified mutant protein is modified by reacting said cysteine residue with a glycosylated 
thiosulfonate, evaluating a functional characteristic of the first and second chemically 
modified mutant proteins, and correlating the functional characteristic of the first and 
second chemically modified mutant proteins with the structures of the first and second 

20 chemically modified mutant proteins. 

The chemically modified mutant proteins of the present invention provide 
a valuable source of carbohydrate-protein conjugates. Moreover, the methods of the 
present invention allow the preparation of pure and glycoproteins (i.e., not mixtures) with 
predetermined and unique structures. These glycoproteins can then be used to determine 

25 structure-function relationships (e.g., structure-activity relationships ("SARs")) of non- 
natural variants of the proteins. 

The protein of the invention may be any protein for which a modification 
of the glycosylation pattern thereof may be desirable. For example, proteins which are 
naturally not glycosylated may be glycosylated via the invention. Similarly, proteins 

30 which exist in a naturally glycosylated form may be modified so that the glycosylation 
pattern confers improved or desirable properties to the protein. Specifically, proteins 
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useful in the present invention are those in which glycosylation plays a role in functional 
characteristics such as, for example, biological activity, chemical activity, 
pharmacological activity, or immunological activity. 

Glycosylated proteins as referred to herein means moieties having 
5 carbohydrate components which are present on proteins, peptides, or amino acids. In the 
present invention, the glycosylation is provided, for example, as a result of reaction of the 
glycosylated thiosulfonate with the thiol hydrogen of a cysteine residue thereby producing 
an amino acid residue which has bound thereto the carbohydrate component present on 
the glycosylated thiosulfonate. 
10 Another aspect of the present invention is a method of modifying the 

functional characteristics of a protein including providing a protein and reactmg the 
protein with a glycosylated thiosulfonate reagent under conditions effective to produce a 
glycoprotein with altered functional characteristics as compared to the protein. 

The functional characteristics of a protein v^^hich may be altered by the 
15 present invention include, but are not limited to, enzymatic activity, the effect on a human 
or animal body, the ability to act as a vaccine, the tertiary structure (i.e., how the protein 
folds), whether it is allergenic, its solubility, its signaling effects, its biological activity, 
and its pharmacological activity (Paulson, "Glycoproteins: What are the Sugar Chains 
For?", Trends in Biochem. Sciences , 14:272-276 (1989), which is hereby incorporated by 
20 reference). The use of glycosylated thiosulfonates as thiol-specific modifying reagents in 
the method of the present invention allows virtually unlimited alterations of protein 
residues. In addition, this method allows the production of pure glycoproteins with 
predetermined and unique structures and, therefore, unique functional characteristics, with 
control over both the site and level of glycosylation. In particular, the method of 
25 modifying the functional characteristics of a protein allows the preparation of single 
glycoforms through regio- and glycan-specific protein glycosylation at predetermined 
sites. Such advantages provide an array of options with respect to modification of protein 
properties which did not exist in the prior art. The ability to produce proteins having very 
specific and predictable glycosylation patterns will enable the production of proteins 
30 which have known and quantifiable effects in chemical, pharmaceutical, immunological, 
or catalytic performance. For example, with knowledge of a specific problematic epitope, 
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ii would be possible to construct a modified protein according to the present invention in 
which the epitope is masked by a carbohydrate moiety, thus reducing its allergenic or 
immunogenic response in a subject. As another example, where the solubility of a proiem 
is problematic in terms of recover)- or formulation in a pharmaceutical or industrial 

5 application, it may be possible, utilizing the present invention, to produce a protein which 
has altered solubility profiles thus producing a more desirable protein product. As 
another example, if a protein has particular problem of being proteolytically unstable in 
the environment in which it is to be used, then it may be possible to mask the proteolytic 
cleavage sites in the protein using the present invention to cover up such a site with a 

10 carbohydrate moiety . These examples are merely a few of the many appHcations of the 
present invention to produce improved proteins. 

In a preferred embodiment, the protein is an enzyme. The term "enzyme'' 
includes proteins that are capable of catalyzing chemical changes in other substances 
without being changed themselves. The enzymes can be wild-type enzymes or variant 

15 enzymes. Enzymes within the scope of the present invention include pullulanases: 

proteases, cellulases, amylases, isomerases, lipases, oxidases, and reductases. Preferably, 
the enzyme is a protease. The enzyme can be a wild-type or mutant protease. Wild-type 
proteases can be isolated from, for example, Bacillus lentus or Bacillus amyloliquefaciens 
(also referred to as BPN'). Mutant proteases can be made according to the teachings of, 

20 for example, PCT Publication Nos. WO 95/10615 and WO 91/06637, which are hereby 
incorporated by reference. Functional characteristics of enzymes which are suitable for 
modification according to the present invention include, for example, enzymatic activity, 
solubility, partitioning, cell-cell signaling, substrate specificity, substrate binding, 
stability to temperature and reagents, ability to mask an antigenic site, physiological 

25 functions, and pharmaceutical functions (Paulson, "Glycoproteins: What are the Sugar 
Chains For?", Trends in Biochem. Sciences , 14:272-276 (1989), which is hereby 
incorporated by reference). 

The protein is modified so that a non-cysteine residue is substituted with a 
cysteine residue, preferably by recombinant means. Preferably, the amino acids replaced 

30 in the protein by cysteines are selected from the group consisting of asparagine, leucine, 
or serine. 
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The terms ''thiol side chain group," "thiol containing group/* and "thiol 
side chain'* are terms which are can be used interchangeably and include groups that are 
used to replace the thiol hydrogen of a cysteine used to replace one of the amino acids in a 
protein. Commonly, the thiol side chain group includes a sulfur tlirough which the thiol 
5 side chain groups defined above are attached to the thiol sulfur of the cysteine. 

The glycosylated thiosulfbnates of the invention are those which are 
capable of reacting with a thiol hydrogen of a cysteine to produce a glycosylated amino 
acid residue. By glycosylated is meant that the thiosulfonate has bound thereto a sugar or 
carbohydrate moiety which can be transferred to a protein pursuant to the present 
10 invention. Preferably, the glycosylated thiosulfonates are glycosylated 

alkylthiosulfonates, most preferably, glycosylated methanethiosulfonates. Such 
glycosylated methanethiosulfonate have the general formula: 

O 
II 

H3C— S— SR 
II 

o 

In particularly preferred embodiment, the methanethiosulfonate comprises 
15 an R group which comprises: -p-Glc, -Et-P-Gal. -Et-p-Glc, -Et-a-Glc, -Et-a-Man, 
-Et-Lac, -P-Glc(Ac)„ -p-Glc(Ac)„ -p-Glc(Ac),, -Et-a-Glc(Ac),, -Et-a-Glc(Ac),, 
-Et-a-Glc(Ac),, -Et-p-Glc(Ac)., -Et-p-Glc(Ac);„ -Et-p-Glc(AcX, -Et-a-ManCAc),, 
-Et-a-Man(Ac)„ -Et-p-GaKAc),, -Et-p-GaKAc)^, -Et-Lac(Ac)5, -Et-LacCAc)^, or -Et- 
Lac(Ac)7. 

20 In a preferred embodiment, the carbohydrate moiety of the present 

invention is a dendrimer moiety. Multiple functionalization of chemically modified 
mutant proteins can be achieved by dendrimer approaches, whereby multiple-branched 
linking structures can be employed to create poly-functionalized chemically modified 
mutant proteins. 

25 Highly branched molecules or dendrimers were first synthesized by Vogtle 

in 1978 (Buhleier et al.. Synthesis , 155-158 (1978), which is hereby incorporated by 
reference). The attachment of identical building blocks that contain branching sites to a 
central core may be achieved with a high degree of homogeneity and control. Each 
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branch contains a functional group which, after chemical alteration, may be connected to 
yet another branching building block. In this manner, layer after layer of branching 
rapidly generates highly-funciionalized molecules. 

For instance, multiple glycosylation, including multiple mannose- 
containing chemically modified mutant proteins, and varied sugar moieties, can be 
created. The dendrimer reagent structures would include methanethiosulfonates with 
simple branching such as: 

CH3SO2S , ^ 



derived from pentaerythritol, to very complex branched dendrimer reagents (see Figure 
10 1). 

In particular, a first generation glycodendrimer reagent is synthesized as 
shown in Figure 2. This approach can be extended to cover larger dendrimers. More 
specifically, by leaving one "arm" of the glycodendrimer free for conversion to a 
methanethiosulfonate, the remaining arms can be further branched to synthesize highly- 
15 functionalized glycodendrimer reagents as shown in Figure 3. Through further branching 
and by using different carbohydrates, this concept can be extended to virtually unlimited 
levels. 

A flexible synthetic strategy for the synthesis of core dendrimeric 
methanethiosulfonate building blocks that may be used either in situ or before 
20 modification to construct dendrimers is shown in Figure 4. 

The present invention also relates to glycosylated thiosulfonate 
compositions. Preferably the glycosylated thi ©sulfonates are methanethiosulfonates and 
comprise a chemical structure: 

O 
II 

H3C— S— SR 
II 

O 

25 wherein R comprises -(i-GIc, -Et-P-Gal, *Et-P-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, -p- 
Glc(Ac),, -P-Glc(Ac)3, -p-Glc(Ac)4, -Et-a-GlcCAc),, .Et-a-Glc(Ac)3, -Et-a-GlcCAc)^, -Et- 
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p-Glc(Ac).. -Et-p-Glc(Ac),. -Et-p-Glc(Ac),, -Et-a-Man(Ac);„ -Et-a-Man(Ac),, -Et-p- 
GalCAc),. -Et-p-Gal(Ac),. -Et-Lac(Ac),, -Et-Lac(Ac),. or -Et-LacCAc),. 

The present invention also relates to a method of determining the structure- 
function relationships of chemically modified mutant proteins. This method involves 

5 providing first and second chemically modified mutant proteins of the present invention, 
wherein the glycosylation pattern of the second chemically modified mutant protein is 
different from the glycosylation pattern of the first chemically modified mutant protein, 
evaluating a functional characteristic of the first and second chemically modified mutant 
proteins, and correlating the functional characteristic of the first and second chemically 

10 modified mutant proteins with the structures of the first and second chemically modified 
mutant proteins. 

Evaluating a functional characteristic of the first and second chemically 
modified mutant protein includes testing for functional characteristics including, but not 
limited to, stability to temperature and reagents, solubility, partitioning, enzymatic 

15 activity, cell-cell signaling, substrate specificity, substrate binding, ability to mask an 
antigenic site, physiological functions, and pharmaceutical functions (Paulson, 
''Glycoproteins: What are the Sugar Chains For?", Trends in Biochem. Sciences , 14:272- 
276 (1989), which is hereby incorporated by reference). 

Another aspect of the present invention is a second method of determining 

20 -the structure-function relationships of chemically modified mutant proteins. This method 
involves providing first and second chemically modified mutant proteins of the present 
invention, wherein at least one different cysteine residue in the second chemically 
modified mutant protein is modified by reacting said cysteine residue with a glycosylated 
thiosulfonate, evaluating a functional characteristic of the first and second chemically 

25 modified mutant proteins, and correlating the functional characteristic of the first and 
second chemically modified mutant proteins with the structures of the first and second 
chemically modified mutant proteins. 

By way of example to illustrate some of its advantages, the following 
discussion v/ill focus on certain proteases which are modified according to the methods of 

30 the present invention. Alkaline serine proteases (subtilisins) are finding increasing use in 
biocatalysis, particularly in chiral resolution, regioselective acylation of polyfunctional 
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conipounds, peptide coupling, and glycopepiide synthesis. As shown in Figure 5, 
subtilisins can catalyze peptide bond formation starting from an ester substrate, by first 
forming an acyl enzyme intermediate which then reacts with a primary amine to form the 
peptide product. This application requires high esterase activity to promote acyl enzyme 
formation and low amidase activity to minimize hydrolysis of the peptide bond of the 
desired product. Generally, subtilisins do not meet these requirements. However, the 
improvement of the esterase to amidase seleciivities of subtilisins has been a long sought 
after goal. By using the methods provided for in the present invention, it is possible to 
produce subtilisins which have advantageous properties. 

The inventors in the present case used site specific mutagenesis to modify 
certain residues and introduce additional cysteine residues within subtilisin which would 
then serve to react with a glycosylated methanethiosulfonate to produce a glycosylation ^ 
point at the introduced cysteine. Bacillus lemus subtilisin was selected for illustrative 
purposes because it does not contain a natural cysteine and is not naturally glycosylated. 

The substrate binding site of an enzyme consists of a series.of subsites 
across the surface of the enzyme. The portion of substrate that corresponds to the subsites 
are labeled P and the subsites are labeled S. By convention, the subsites are labeled S,, 
S^, S3, S4, S,', and S2'. A discussion of subsites can be found in Berger et aL, Phil. Trans. 
Roy. Soc. Lond. B , 257:249-264 (1970). Siezen et aL, Protein Engineering , 4:719-737 
(1991), and Fersht, Enzyme Structure and Mechanism , 2 ed.. Freeman: New York, 29-30 
(1985), which are hereby incorporated by reference. 

In the present illustration, the S,, S,', or S, subsites were selected as 
suitable targets for modification. In particular, the amino acids corresponding to N62, 
L217, SI 56, and SI 66 in naturally-occurring subtilisin from Bacillus amyloliquefaciens 
or to equivalent amino acid residues in other subtilisins, such as Bacillus lentus subtilisin, 
were selected for modification to cysteine. The mutated subtilisin was produced through 
standard site directed mutagenesis techniques and the obtained mutant subtilisin was 
reacted with certain glycosylated alkylthiosulfonates, particularly glycosylated 
methanethiosulfonates, as provided in the examples appended hereto. 

Enzymatic peptide coupling is an attractive method for preparation of a 
variety of peptides, because this method requires minimal protection of the substrate. 
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proceeds under mild conditions, and does not cause racemization. Wong et al.. Enzvmes 
in Synthetic Organic Chemistry , Pergamon Press: Oxford. 41-130 (1994), which is hereby 
incorporated by reference. In spite of these adyantages, two major problems have limited 
the use of serine proteases in peptide synthesis. One is their efficient proteolytic 
5 (amidase) activity which causes hydrolysis of the coupling product, and the other is their 
stringent structural specificity and stereospecificity. 

Surprisingly, it was found that the chemically modified mutant subtilisins 
of the present invention have altered esierase-to-amidase activity as compared to the 
precursor enzyme. Increasing the esterase-to-amidase ratio enables the use of the enzyme 
10 to more efficiently catalyze peptide synthesis. In particular, subtilisins can catalyze 
peptide bond formation starting from an ester substrate (i.e. an acyl donor), by first 
forming an acyl enzyme intermediate which then reacts with a primary amine (i.e. an acyl 
acceptor) to form the peptide product, as shown in Figure 5. This reaction thus requires 
high esterase activity to promote acyl enzyme formation and. then, low amidase activity 
1 5 to minimize hydrolysis of the peptide bond of the desired product. The chemically 
modified mutant subtilisins produced according to the present invention show an 
increased esterase-to-amidase ratio, without reducing the absolute esterase activity of the 
enzyme. In addition, certain modified enzymes of the present invention even show a 
concomitant increase in the absolute esterase activity. 
20 Therefore, an unexpected benefit of subtilisins which are modified 

according to the present invention is that they can be used in organic synthesis to, for 
example, catalyze a desired reaction and/or favor a certain stereoselectivity. See e.g., 
Noritomi et al. Riotech. Bioeng. 51:95-99 (1996); Dabulis et al. Biotech. Bioeng. 41:566- 
571 (1993), and Fitzpatrick et al .1. Am. Chem. Soc. 113:3166-3171 (1991), which are 
25 hereby incorporated by reference. 

Proteins obtained using the methods provided herein may be used in any 
application in which it is desired to use such proteins, where having modified functional 
capabilities is advantageous. Thus, proteins modified as provided herein may be used in 
the medical field for pharmaceutical compositions and in diagnostic preparations. 
30 Additionally, proteins such as enzymes which are modified according to the present 
invention may be used in applications which are generally known for such enzymes 
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including industrial applications such as cleaning products, textile processing, feed 
modification, food modification, brewing of grain beverages, starch processing, as 
antimicrobials, and in personal care formulations. Moreover, the unique functionalities 
made possible by the present invention may result in uses for proteins which have not 
5 heretofore been recognized as feasible. 

EXAMPLES 

Example 1 -Preparation of Methanethiosulfonate ("MTS") Reagents 

10 The preparation of NaSSOXH^ (Kenyon et al.. Methods Enzvmol. . 

47:407-430 (1977), which is hereby incorporated by reference) has been described 
previously (Berglund et al., J. Am. Chem. Soc . 1 19:5265-5266 (1997), which is hereby 
incorporated by reference). Acetobromoglucose (3) (See Figure 6) (prepared from D- 
glucose according to Scheurer et al., J. Am. Chem. Soc 76:3224 (1954), which is hereby 

15 incorporated by reference) in 73% yield, pentaacetylglucose (prepared from the 

corresponding parent carbohydrates according to the method of Verley et al., Ber. Dtsch. 
Chem. Ges. . 34:3354-3358 (1901), which is hereby incorporated by reference, and 
purified by flash chromatography) in 99% yield, 5d (See Figure 7) (prepared from the 
corresponding parent carbohydrates according to the method of Verley et al., Ber. Dtsch. 

20 Chem. Ges. , 34:3354-3358 (1901), which is hereby incorporated by reference, and 

purified by flash chromatography) in 92% yield, 5e (See Figure 7) (prepared from the 
corresponding parent carbohydrates according to the method of Verley et al., Ber. Dtsch. 
Chem. Ges. , 34:3354-3358 (1901), which is hereby incorporated by reference, and 
purified by flash chromatography) in 99% yield, 5f (See Figure 7) (prepared from lactose 

25 according to the method of Hudson et al., J. Am. Chem. Soc , 37:1270-1275 (1915), 

which is hereby incorporated by reference, and purified by flash chromatography in 82% 
yield) were prepared according to literature methods. N,N-dimethylformamide ("DMF") 
was distilled under N, from CaHj and stored over molecular sieve under N, before use. 
Methanol was distilled from Mg/U under N, immediately prior to use. Br(CH2)20H was 

30 stood over and distilled from CaO under reduced pressure and stored under N, prior to 
use. AM other chemicals were used as received from Sigma-Aldrich (St. Louis, MO) or 
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Baker (Phillipsburg, NJ). All flash chromatography was performed using silica gel 
(Whatman, 60A, 230-400 Mesh, Clifton. NJ). N4elting points were determined using an 
Electrothermal IA9000 series digital melting point apparatus and are uncorrected. IR 
spectra were recorded on Bomem MB or Perkin-Elmer FTIR Spectrum 1000 
spectrophotometers- 'H NMR and ' 'C NMR spectra were recorded on Varian Gemini 
200, Unity 400 or Unity 500 NMR spectrometers at the frequencies indicated. Where 
indicated, NMR peak assignments were made using Correlation Spectroscopy C'COSY") 
or Distoitionless Enhanced by Polarization Transfer ('*'DEPT") experiments, all others are 
subjective. All chemical shifts were referenced to residual solvent as an internal standard; 
for ''C NMR in D^O 1 ,4-dioxan (67.6 ppm) was used. ES-MS data were acquired using a 
PE SCIEX API III Biomolecular mass spectrometer. All HRMS data were acquired using 
Micromass 70-2508 or Micromass ZAB-SE mass spectrometers according to the 
ionization methods indicated. Solvents were removed in vacuo. 

Preparation of 2,3,4, S-Tetra-O-acetyl-^-D-glucopyranosyl methanethiosulfonate (la). 

Initial approaches to untethered glyco-MTS reagents similar in type to la 
(See Figure 8) were based upon Danishefsky's glycal methodology (Halcomb et ah, L 
Am. Chem. Soc , 1 1 1:6661-6666 (1989), which is hereby incorporated by reference). 
Tris-TBS glucal was prepared according to the method of Lesimple et al.. Tetrahedron 
20 . Lett. , 27:6201-6204 (1986), which is hereby incorporated by reference, and oxidized to 
tris-TBS protected 1,2-anhydroglucose using dimethyldioxirane. However, under a 
variety of conditions and in contrast to the behavior of other sulfur nucleophiles (Gordon 
et aL, Carbohvdr. Res. , 206:361-366 (1990); Berkowitz et al, J. Am. Chem. Soc , 
1 14:4518-4529 (1992), which are hereby incorporated by reference), 
25 methanethiosulfonate ion failed to open the epoxide moiety of tris-TBS protected 1,2- 
anhydro-D-glucose. Deprotection of la (See Figure 8) was attempted under a variety of 
conditions, but in all cases led only to decomposition or hydrolysis of the thioglucosidic 
bond (Zemplen et al., Ber. Dtsch. Chem. Ges. , 56:1705-1710 (1923); Plattner et al., L 
Am. Chem. Soc , 94:8613-8615 (1972); Mori et al., Tetrahedron Lett. , 20:1329-1332 
30 ( 1 979); Herzig et al., Carbohvdr. Res. , 1 53 : 1 62- 1 67 ( 1 986); Herzig et al., J. Org. Chem.. 




5 
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51 :727-730 (1986): Vekemans et al., Tetrahedron Lett. . 28:2299-2300 (1987); Cinget et 
al., Svnlett . 168-170 (1993), which are hereby incorporated by reference). 

Aceiobromoglucose (3) (See Figure 6) (1 g, 2.43 mmol) was added to a 
solution of NaSSO^CH-, (380 mg. 2.84 mmol) in ethanoi (4 mL) at 90''C under N,. After 
5 20 minutes the resulting suspension was cooled and the solvent removed. The residue 
was purified by flash chromatography (EtOAc : hexane, 9:1 1) and the resulting solid 
recrystallized from ether to give la (See Figure 8) (674 mg, 63%) as a white solid; mp 
151-1 52°C melts then decomp. (ether); [afo = - 1 9.0 (cl .24, CHCI,); IR (KBr) 1 749 cm" 
' (C=0), 1333, 1 140 cm ' (S-SO,); 'H NMR (400 MHz. CDCI3) 6 2.00, 2.04, 2.06, 2.07 (s 

10 X 4, 3H X 4, Ac X 4). 3.44 (s, 3H, CHjSO,-), 3.82 (ddd, J^ , 10.1 Hz, 5.9 Hz. J,,- 2.2 
Hz, IH, H-5), 4.08 (dd, J. ^, 5.9 Hz, J^ ^,- 12.5 Hz, IH, H-6), 4.3 1 (dd, J, ^- 2.2 Hz, J^.s- 12.5 
Hz, IH, H-61, 5.05 (t, J9.8 Hz, IH, H-4), 5.07 (dd, J,., 10.5 Hz, J,j 9.4 Hz, IH, H-2), 
5.25 (d,y,.2 10.5 Hz,lH, H-1), 5.29 (t, J9.3 Hz, IH, H-3); NMR (50 MHz, CDCI-) 5 
20.5, 20.7 (CH.COO- x 4), 52.8 (CH3SO,-), 61.8, 68.0, 68.7, 73.3, 76.6 (C-2, C-3. C-4, 

15 C-5, C-6), 86.4 (C-1). 169.3, 169.3, 169.7, 170.1 (CH,COO- x 4); HRMS m/z (EI+-): 
Found 443.0636 (M+H'); CijHjjO, ,3, requires 443.0682. 

Preparation of 2-(2, 3. 4. 6-Tetra-0-acetyl-a-D-glucopyranosyl)ethyl methanethiosulfonate 
(JgJ- 

20 BFj.EtjO (145 |iL, 1.1 mmol) was added dropwise to a suspension of D- 

glucose (2a) (See Figure 6) (1 .45 g, 8.1 mmol) in Br(CH3)20H (19 mL) under N, and the 
resulting mixture heated to 105°C. After 8 hours, the resulting solution was cooled and 
the solvent removed. The residue was dissolved in Ac,0/pyridine (2:3 v/v, 16 mL) under 
N,. After a further 24 hours, the reaction solvent was removed and the residue purified by 

25 repeated flash chromatography (EtOAc then EtOAc : hexane, 3:7) to give 2-bromoethyl 
2,3,4,6-tetra-O-acetyl-a-D-glucopyranoside (4g) (See Figure 6) (1.76 g, 48%) as a 
colorless oil that crystallized on standing to give a white solid; mp 86-88''C; [aj'^o = + 
1 30.6 (c 0.2 1 , CHC1.0; IR (film) 1 749 cm ' (C=0); 'H NMR (400 MHz, CDCI3) 5 2.0 1 , 
2.03, 2.0,7, 2.09 (s x 4, 3H x 4, Ac x 4), 3.51 (I, J 5.9 Hz, 2H, -CHjBr), 3.83 (dt, J^\l.6 

30 Hz, y, 5.8 Hz, IH, -OCHH'-), 3.96(dt,J, 1 1.6 Hz, 5.8 Hz, IH, -OCHHl-), 4.10 (dd, 
2.2 Hz, ^6.4 12.0 Hz, IH. H-6), 4. 14 (ddd, J,j 10.2 Hz, Jy^, 2.2 Hz, J5.4. 4.4 Hz, IH, H-5). 
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4.24 (dd.J,., 4.4 Hz, J,.,- 12.0.Hz, IH, H-6'), 4.84 (dd, J, , 3.8 Hz, J,., 10.3 Hz, IH, H-2), 
5.05 (t. J9.7 Hz, IH, H-4), 5.14 (d. , 3.8 Hz, IH, H-1), 5.49 (dd, J,., 10.3 Hz. J, , 9.5 
Hz. 1 H. H-3); NMR ( 1 00 MHz, CDCl,) 5 20.6, 20.7 (CH.COO- x 4), 29.9 (-CH.Br), 
61.9. 67.8. 68.5, 68.8, 70.0, 70.8 (-OCH.-. C-2, C-3. C-4, Co, C-6), 96.0 (C-l), 169.6, 
5 170.0. 170.2. 170.6 (CH3COO-x4): HRiMS m/z (FAB+): Found 477.038 1 (M+Na'); 
C„.H.,0,oBrNa requires 477.0372. NaSSO.CH, (75 mg, 0.56 mmoi) was added to a 
solution of 4g (See Figure 6) (190 mg. 0.42 mmol) in DMF (6 mL) under N, and warmed 
to 50 '^C. After 21 hours, the solution was cooled and the solvent removed. The residue 
. was purified by flash chromatography (EtOAc : hexane, 1:1) to give Ig (See Figure 6) 

10 (183 mg. 90%) as a colorless oil; 92.1 {c 0.39, CHCl,); IR (film) 1748 cm"' 

(C=0). 1322, 1 134 cm ' (S-SO,); 'H NMR (400 MHz, CDCl,) 6 2.01, 2.03, 2.07, 2.09 (s x 
4, 3H X 4, Acx4), 3.41 (t, J5.7Hz, 2H, -CH,S-), 3.41 (s, 3H, CH.SO,-). 3.75 (dt, J, 10.8 
Hz, J, 5.7 Hz, IH, -OCHH'-), 3.99-4.06 (m, 2H, H-5, -OCHHI-), 4.09 (dd, J, , 2.4 Hz, J,^- 
12.6 Hz. IH, H-6), 4.25 {Ad^J^,- 4.6 Hz, J.^- 12-6 Hz, IH, H-6^), 4.87 (dd, J, , 3.9 Hz, J,., 

15 10.3 Hz, IH, H-2), 5.06 (t, J 9.8 Hz, IH, H-4), 5.12 (d, J,., 3.9 Hz. IH. H-1), 5.43 (t, J9.8 
Hz, IH, H-3); ''C NMR (100 MHz, CDCI3) 5 20.6, 20.6, 20.7, 20.7 (CH.COO- x 4), 36.0, 
(-CH.S-), 50.8 (CH3SO3-), 61.8 (-OCH3-), 67.0, 67.8, 68.3, 69.8, 70.7 (C-2, C-3, C-4, C-5, 
C-6). 96.0 (C-1), 169.5, 170.0, 170.6 (CH.COO- x 4); HRMS m/z (FAB+): Found 
487.0946 (M+H'); Cj^H.^O^S. requires 487.0944. 

20 ^ 

Preparation of 2-(a-D-GlucopyranosyI)eihyl methanethiosulfonate (lb). 

A solution of NaOMe (O.IM, 0.3 mL) was added to a suspension of 4g 

(See Figure 6) (300 mg, 0.66 mmol) in MeOH (3 mL) under N. and stirred vigorously. 

After 6 hours, the resulting solution was passed through a Dowex 50W(H^) plug (2 x 1 
25 cm, eluant MeOH) and the solvent removed to give 2-bromoethyl a-D-glucopyranoside 

bromide (4b) (See Figure 6) (178 mg, 94%) (The synthesis of 4b as an intermediate has 

been described previously. However, this method gave only a poor yield of product. 

Nagai et al., Carbohvdr. Res. , 190:165-180 (1989), which is hereby incorporated by 

reference) as a white solid. NaSSOXH^ (100 mg, 0.75 mmol) was added to a solution of 
30 4b (See Figure 6) (1 78 mg, 0.62 mmol) in DMF (7 mL) under N, and warmed to 50 °C. 

After 25 hours, the solution was cooled and the solvent removed. The residue was 
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purified by flash chromatography (MeOH : EtOAc, 1 :9) to give lb (See Figure 6) (144 
mg, 73%) as a hygroscopic foam; [aj-'p = + 109.9 (c 1 . 1 1, H.^O): IR (film) 3423 cm ' 
(OH), 1309, 1128 cm-' (S-SOj): 'H N'MR (500 MHz, D,0. COSY) 6 3. 16 (t, J 9.5 Hz. IH. 
H-4), 3.28 (I, J 5.9 Hz. 2H, -CH,S-), 3.30 (s, 3H, CH-.SO,-). 3.3 1 (dd, J, , 3.8 Hz. J, , 9.9 
Hz, IH, H-2), 3.44 (t, J9.5 Hz, IH, H-3), 3.47-3.53 (m. 2H. H-6, H-6'), 3.58-3.61 (m. 
1 H. H-5), 3.62 (dl, J, 5.4 Hz. 10.8 Hz, 1 H, -OCHH -). 3.79 (dt, J, 6.3 Hz, J, 10.8 Hz. 
IH, -OCHHi), 4.72 (d,^, , 3.8 Hz, IH, H-1); ' 'C NMR (100 MHz. D,0) 6 36.7 (-CH,S-). 
50.7 (CH.SO,-), 61.5 (-OCH,-), 67.2, 70.5, 72.3, 73.2. 74.0 (C-2. C-3, C-4, C-5, C-6), 
99.4 (C-1); HRMS m/z (FAB+); Found 319.0517 (M+H"): C<,H„08S3 requires 319.0521 . 

Preparaiion of2-(2, 3. 4, 6-Tetra-0-acetyl-^-D-glucopyranosyl)ethyl melhanethiosulfonaie 
(Ih). 

BFvEt20 (3.3 mL, 26.0 mmol) was added dropwise over the course of 15 
minutes to a solution of 1 ,2 J,4,6-penta-0-acetyl-a,P-D-glucose (2 g, 5.1 mmol) and 
Br(CH2)20H (0.45 mL, 6.3 mmol) in CH^Cl, (9 mL) at 0 °C under N,. After 1 .5 hours, 
the solution was warmed to room temperature. Aften20 hours the reaction solution was 
added to ice water (15 mL) and extracted with CHXU (1 5 mL x 3). These extracts were 
combined, washed with water (15 mL), sat. NaHCO^ (aq., 15 mL), water (15 mL), dried 
(MgSOJ, filtered, and the solvent removed. The residue was purified by flash 
chromatography (EtOAc : hexane, 1:3) to give 2-bromoethyl 23,4,6-tetra-O-acetyl-P-D- 
glucopyranoside (4h) (See Figure 6) (1 .42 g, 61%) as a white solid; mp 1 18-120 °C 
(EtOAc//50-octane) [lit., (Coles et ai., J. Am. Chem. Soc , 60:1020-1022 (1938), which is 
hereby incorporated by reference) mp 1 17.3 °C (EtOH)]; [aj-'^ = - 1 1.9 (c 1.65, CHCl,) 
[lit., (Helferich et al.. Just. Lieb. Ann. Chem. , 54 1 : 1 - 1 6 ( 1 939), which is hereby 
incorporated by reference) [a]-°D = - 12.3 (c 0.2, CHCl,)]; 'H NMR (200 MHz, CDCl,) 5 
2.00, 2.02, 2.07, 2.09 (s x 4, 3H x 4, Ac x 4), 3.42-3.5 1 (m. 2H), 3.67-3.87 (m, 2H), 4. 1 0- 
4.31 (m, 3H), 4.57 (d, y, , 8 Hz, IH, H-1), 4,97-5.27 (m, 3H). NaSS02CH3 (260 mg, 1.94 
mmol) was added to a solution of 4h (See Figure 6) (640 mg, 1 .41 mmol) in DMF (18 
mL) under and warmed to 50°C. After 25 hours, the solution was cooled and the 
solvent removed. The residue was purified by flash chromatography (EtOAc : hexane, 
1:1) and the resulting solid recrystallized from EtOAc/hexane to give Ih (See Figure 6) 
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(544 mg. 80%) as a white solid; mp 1 15-1 16°C (EtOAc/hexane); [af^ = + 5.4 (c 1 .06. 
CHCU): IR (KBr) 1758. 1741 cm"' (C=0), 1314. 1 133 cnv' (S-SO,); 'H NMR (500 MHz. 
CDCl,. COSY) 5 1 .99. 2.02. 2.06, 2.08 (s x 4. 3H x 4. Ac x 4), 3.30-3.38 (m. 2H. -CH,S- 
). 3.34 (s. 3H. CH-,SOr), 3.70 (ddd, 9.9 Hz. J,, 2.2 Hz. J,.,- 4.6 Hz. 1 H, H-5). 3.83 

5 (ddd, J 5.6 Hz, J 7.4 Hz. J 1 0.5 Hz, 1 H, -OCHH'-). 4. 1 3-4. 1 8 (m, 2H, H-6, -OCHHT-). 
4.24(dd.A, 4.6 Hz, J,,- 12.4 Hz. IH, H-6'). 4.55 (d, J, , 8.1 Hz, IH, H-1), 4.98 (dd. J; , 
8.1 Hz. J:.3 9.7 Hz, lH,H-2), 5.07 (t. J9.9Hz, IH. H-4), 5.19(t, J9.6 Hz, IH. H-3): "C 
NMR (125 MHz, CDCl,, DEPT) 5 20.5. 20.7 (q x 2, CH,COO- x 4), 36.0, (t. -CH,S-). 
50.6 (q. CH,SO,-), 61 .6 (t, -OCH,-), 68. 1 . 70.8, 71 .9. 72.5 (d x 4, C-2, C-3, C-4. C-5), 

10 68.4 (t. C-6). 100.8 (d, C-1), 169.3, 170.0. 170.5 (s x 3, CH;,COO- x 4); HRMS m/r 
(FAB+): Found 487.0940 (M+H"); C|7H,,0„S, requires 487.0944. 

Preparation of 2-(^-D-Glucopyranosyl)ethyl melhanethiosulfonate (Ic). 

A solution of NaOMe (O.IM. 0.3 mL) was added to a suspension of 4h 

1 5 (See Figure 6) (300 mg, 0.66 mmol) in MeOH (3 mL) under N. and stirred vigorously. 
After 4 hours, the resulting solution was passed through a Dowex 50W(H'^) plug (2 x 1 
cm, eluant MeOH) and the solvent removed to give 2-bromoethyl p-D-glucopyranoside 4c 
(See Figure 6) (176 mg, 93%) as a white solid that was used directly in the next step. A 
sample was recrystallized from EtOH/EtOAc to give a colorless, crystalline solid; mp 74- 

20 ■ 78°C (EtOH/EtOAc) [lit., (Helferich et al .Inst. I.ieb. Ann. Chem. . 541:1-16 (1939), 
■ which is hereby incorporated by reference) mp 74-75°C (EtOH/EtOAc)]; [a]-^D = - 22.4 
(c 1.63, HjO) [lit., (Helferich et al.. Just. Lieb. Ann. Chem. , 541:1-16 (1939), which is 
hereby incorporated by reference) [aj^p = - 26.1 (c 3.0, H,0)]; 'H NMR (400 MHz, 
CD3OD) 5 3.30 (t, J 8.4 Hz, IH, H-2), 3.39-3.49 (m, 3H), 3.64-3.80 (m, 3H), 3.97 (br d, 

25 Je.e- 1 1 -7 Hz, 1 H, H-6'), 4.02 (dt, J, 6.5 Hz, 7^ 1 1 -3 Hz, 1 H, -OCHH'-), 4.23 (dt, J, 6.5 Hz, 
J, 1 1.3 Hz, IH, -OCHH:-), 4.44 (d, J,., 7.9 Hz, IH, H-1). NaSSOjCH, (100 mg, 0.75 
mmol) was added to a solution of 4c (See Figure 6) (176 mg, 0.61 mmol) in DMF (7 mL) 
under N, and warmed to 50°C. After 15 hours, the solution was cooled and the solvent 
removed. The residue was purified by flash chromatography (MeOH : EtOAc, 1 :9) to 

30 give Ic (See Figure 6) (144 mg, 74%) as a hygroscopic foam; [af'o = - 1 5.8 (c 0.88, 
H,0); IR (KBr) 3400 cm'' (OH), 1310, 1 131 cm ' (S-SO,); 'H NMR (500 MHz, D,0, 
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COSY) 5 3.07 (dd, J, , 8.1 Hz, J, , 9.4 Hz, IH, H-2), 3.16 (dd, J,, 9.0 Hz, J, , 9.8 Hz, 1 H, 
H-4), 3.24 (ddd,7, j9.8 Hz. J^, 6.0 Hz, 75 6-2-3 Hz, 1 H, H-5), 3.27 (t, 7 9.0 Hz, IH, H-3). 
3.30-3.33 (m, 2H, -CH.S-). 3.34 (s. 3H, CH.SO,-), 3.50 (dd, 7,^ 6.0 Hz, J^^ 12.4 Hz, IH. 
H-6), 3.69 (dd,^^,. 2.3 Hz, 7,, 12.4 Hz, IH, H-6"), 3.81 (dt, 7, 5.8 Hz, Jj 11.5 Hz, IH, - 
5 OCHH^-), 4.00 (dt, J, 5.7 Hz, J, 1 1 .4 Hz, IH, -OCHH:-), 4.30 (d, , 8.1 Hz. IH, H-I): 
NMR(50 MHz, D,0) 6 36.9 (-CH,S-), 51.0 (CH.SO,-), 62.0 (-OCH,-), 69.5. 70.9, 
74.3, 76.7, 77.3 (C-2, C-3, C-4, C-5. C-6), 103.7 (C-1); HRMS m/z (FAB+): Found 
341.0351 (M+Na*); QHigOgS.Na requires 341.0341. 

1 0 Preparation of2-(2, 3, 4, 6- Tetra-O-acetyl-a-D-mannopyranosyljelhyl 
methanethiosulfonate (li). 

BFj.EtjO (7.7 mL, 60.7 mmol) was added dropwise over the course of 1 5 
minutes to a solution of l,2,3,4,6-penta-0-acetyi-a,P-D-mannose (5d) (See Figure 7) (4.7 

15 g, 12.1 mmol) and Br(CH,),OH (1.05 mL, 14.8 mmol) in CH,C1, (22 mL) at 0°C under 
Nj. After 1 hour, the solution was warmed to room temperature. After 25 hours,Hhe 
reaction solution was added to ice water (20 mL) and extracted with CH^Cl, (20 mL x 2). 
These extracts were combined, washed with water (20 mL), sat. NaHCO, (aq., 20 mL), 
water (20 mL), dried (MgS04), filtered, and the solvent removed. The residue was 

20 crystallized from EtOAc//5o-octane to give 2-bromoethyl 2,3,4,6-tetra-O-acetyl-a-D- 
mannopyranoside (4i) (See Figure 7) (3.52 g, 64%). Purification of the resulting mother 
liquor by flash chromatography (EtOAc:hexane, 1:3) gave further 4i (See Figure 7) (320 
mg, 6%; 70% in total) as a white highly crystalline solid; mp 121-I23''C [lit., (Dahmen et 
aL, "2-Bromoethyl Glycosides - Synthesis and Characterization." Carbohvdr. Res. . 

25 1 1 6:303-307 (1983), which is hereby incorporated by reference) 11 8- 11 9''C (EtOAc/wo- 
octane)]; [o.f'o = + 48.3 (c 1.31, CHCij) [lit., (Dahmen et al., "2-Bromoethyl Glycosides 
- Synthesis and Characterization," Carbohvdr. Res. . 1 16:303-307 (1983), which is hereby 
incorporated by reference) [af'^ = + 45 (c 0.6, CDCI3)]; 'H NMR (200 MHz, CDCI3) 5 
1.99, 2.05, 2.10, 2.16 (s X 4, 3H x 4, Ac x 4), 3.52 {i,J6 Hz, 2H, -CH,Br), 3.82-4.04 (m, 

30 2H, -OCH,-), 4.09-4.16 (m, IH, H-5), 4.13 (dd, 2 Hz, J^.e- 12 Hz, IH, H-6) 4.28 (dd, 
JsM- 6 Hz, y^fi. 12 Hz, IH, H-6'). 4.87 (br s, IH, H-1), 5.22-5.40 (m, 3H, H-2, H-3, H-4). 
NaSSOjCHj (230 mg, 1 .72 mmol) was added to a solution of 4i (See Figure 7) (600 mg. 
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1 .32 mmol) in DMF (17 mL) under N, and warmed to 55°C. After 20 hours, the solution 
was cooled and the solvent removed. The residue was purified by flash chromatography 
(EtOAc : hexane. 9: 11) and the resulting solid recrystallized from Et,0/hexane to give li 
(See Figure 7) (566 mg, 88%) as a white solid; mp 128-129°C (Et^O/hexane); [a]-\^ = - 

5 53.2 (c 0.92. CHCl,); IR (KBr) 1739 cm-' (C=0), 1325. 1129 cm ' (S-SO,); 'H NMR (500 
MHz. CDCI5. COSY) 5 1.97, 2.04. 2.09. 2.14 (s x 4. 3H x 4, Ac x 4), 3.37-3.40 (m. 211. - 
CH,S-), 3.38 (s, 3H, CH.SO,-), 3.79 (dt. J, 10.5 Hz, J, 5.8 Hz, IH, -OCHH"-), 3.98-4.03 
(m, 2H, -OCHH:-, H-5). 4.09 (dd, J, , 2.5 Hz, A., 12.5 Hz. 1 H, H-6), 4.26 (dd. 5.6 
Hz, J,.,. 12.5 Hz. IH. H-6'), 4.85 (d, J,,, 0.7 Hz. IH. H-1), 5.23-5.29 (m, 3H, H-2. H-3. H- 

10 4); '-"C NMR (50 MHz. CDCI3) S 20.6, 20.7, 20.8 (CH,COO- x 4). 35.7, (-CH,S-), 50.8 
(CH,SO,-), 62.5 (-OCH,-), 66.0, 66.8, 69.0, 69.2, 69.3 (C-2, C-3, C-4, C-5, C-6), 97.7 (C- 
1), 169.7, 169.9, 170.0, 170.6 (CH-COO- x 4); HRMS m/z (FAB+): Found 487.0954 
(M+H^; C,7H,70,,S2 requires 487.0944. 

1 5 Preparation of 2-{o.-D-Mannopyranosyl)ethyl methane thiosulfonate ( Id). 

A solution of NaOMe (0.143 M, 0.7 mL) was added to a suspension of 4i 
(See Figure 7) (1 g, 2.2 mmol) in MeOH (10 mL) under N,. After 3 hours, the resulting 
solution was passed through a Dowex 50W(H0 plug (2 x 1 cm, eluant MeOH) and the 
solvent removed. The residue was purified by flash chromatography (MeOH : EtOAc, 

20 ..;:.2:25) to give 2-bromoethyl a-D-mannopyranoside 4d (See Figure 7) (The use of 4d as a 
reactant has been described previously, although no details of preparation or 
characterization were given. (U.S. Patent 4 918 009 to Nilsson, which is hereby 
incorporated by reference)) (606 mg, 96%) as a white foam; [a]^*D = + 50.7 (c 0.91 , H,0); 
IR (KBr) 3417 cm"' (OH); 'H NMR (500 MHz, D,0, COSY) 6 3.38-3.44 (m, 3H, H-4, - 

25 CHjBr), 3.50-3.55 (m, 2H, H-5, H-6), 3.60 (dd, J.., 3.5 Hz, J3.4 9-7 Hz, IH, H-3), 3.66 (dd, 
J,„. 4.6 Hz, y^.,- 1 1 -2 Hz, 1 H, H-6'), 3.68 (ddd, J4.6 Hz, J 5.4 Hz, J U .7 Hz, IH, - 
OCHH'-), 3.76 (dd, J,., 1 .8 Hz, My 3.5 Hz, 1 H, H-2), 3.81 (ddd, J 5.1 Hz, J 6.5 Hz, J 
11.7, 1H,-0CHH:-),4.71 (d,J,2 1.8 Hz, IH, H-1); ''C NMR (100 MHz, D,0) 5 32.1 (- 
CHjBr), 61.7 (-OCH^-), 67.5, 68.4, 70.7, 71.3, 73.8 (C-2, C-3, C-4, C-5, C-6), 100.5 (C- 

30 1 ); HRMS m/z (FAB+): Found 308.9985 (M+Na^; CgHijO/'BrNa requires 308.9950. 
NaSSOjCHj (150 mg, 1 .12 mmol) was added to a solution of 4d (See Figure 7) (245 mg. 
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0.85 mmol) in DMF (10 mL) under N. and warmed lo SO^'C. After 16 hours, the solution 
was cooled and the solvent removed. The residue was purified by flash chromatography 
(MeOH : ElOAc, 1 :9) to give Id (See Figure 7) (217 mg, 80%) as a hygroscopic foam; 
[«]''d^^ 58-0 (c 1.34, H.O); IR (KBr) 3441 cm ' (OH), 1314, 1 132 cnV' (S-SO.); 'H 
5 NMR (500 MHz, D.O) 5 3.3 1 (t, J 5.8 Hz. 2H, -CH,S-K 3.35 (s, 3H, CH.SO,-), 3.45 (t, J 
9.6 Hz. 1 H, H-4), 3.49 (ddd, J, , 9.8 Hz, X , 5.8 Hz, J, ,- 1 .9 Hz, 1 H, H-5), 3.55 (dd. A, 5.8 
Hz, J,.,- 12J Hz, IH, H-6), 3.60 (dd, 3.4 Hz, J,., 9.0 Hz, IH, H-3), 3.66 (di, 10.7 Hz, 

5.7 Hz, lH,-OCHH'-), 3.69(dd, ^5 6- 1.9 Hz,J^,- 12.1 Hz, 1 H. H-6'). 3.77 (dd, J, , 1.6 
Hz, J,, 3.4 Hz, IH, H-2), 3.83 (dt, J, 1 1.0 Hz, J, 5.9 Hz, IH, -OCHHl-), 4.72 (d, J,,, 1.6 
10 Hz, IH, H-1); "C NMR (125 MHz, D,0) 6 36.7 (-CH.S-), 50.7 (CH.SO,-), 61 .9 (-OCH,- 
), 66.7, 67.7, 70.9, 71.5, 74.0 (C-2, C-3, C-4, C-5, C-6), 100.8 (C-1); HRMS m/z (FAB+): 
Found 319.0528 (M+H"); C^U.^O^S, requires 3 19.0521. 

Preparation of 2'(2,3,4, 6'Tetra-0-acetyl-^'D~galac(opyranosyl)ethyI 
15 methanethiosulfonate (Ij). 

BF3.Et20 (8.5 mL, 67.0 mmol) was added dropwise to a solution of of 
l,2,3,4,6-penta-(9-acetyl-a,P-D-galactose (5e) (See Figure 7) (5.1 g, 13.1 mmol) and 
Br(CH,).OH (1 .15 mL, 16.2 mmol) in CH3CI, (24 mL) at 0°C under N,. After 1 hour, the 

20 solution was warmed to room temperature. After 24 hours, the reaction solution was 

added to ice water (20 mL) and extracted with CH^CU (30 mL x 3). These extracts were 
combined, washed with water (20 mL), sat. NaHC03 (aq., 20. mL), water (20 mL), dried 
(MgS04), filt Vogtle ered, and the solvent removed. The residue purified by flash 
chromatography (EtOAc : hexane, 1 : 3) to give 2-bromoethyl 2,3,4,6-tetra-O-acetyl-p-D- 

25 galactopyranoside (4j) (See Figure 7) (4.01 g, 67%) as a white solid; mp 116-1 17*^0 
(EtOAc/hexane) [lit., (Coles et al., J. Am. Chem. Soc , 60:1020-1022 (1938), which is 
hereby incorporated by reference) 1 1 1°C; lit., (Dahmen et al., "2-Bromoethyl Glycosides 
- Synthesis and Characterization," Carbohvdr. Res. , 1 16:303-307 (1983), which is hereby 
incorporated by reference) 1 14-1 le^'C (EtOAc/light pet. ether)]; [aj-'o = - 3.8 (c 0.81, 

30 CHCI3) [lit., (Dahmen et al., "2-Bromoethyl Glycosides - Synthesis and 

Characterization," Carbohvdr. Res. . 1 16:303-307 (1983), which is hereby incorporated by 
reference) [a]-^o = - 5 (cl.4, CDCI3)]; 'H NMR (200 MHz, CDCl.,) 5 1.98, 2.05, 2.08. 
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2.1 5 (s X 4, 3H X 4, Ac x 4). 3.43-3.50 (m, 2H), 3.75-3.95 (m, 2H), 4.12-4.24 (m, 3H). 
4.53 (d. J,., 8 Hz, IH, H-1). 5.02 (dd, J, , 1 1 Hz, 7,., 3 Hz, IH, Ho), 5.23 (dd,y, , 8 Hz. 
.A 3 1 1 Hz, 1 H. H-2), 5.40 (br d. J, , 3 Hz, 1 H, H-4). NaSSO.CH, (85 mg, 0.63 mmol) was 
added to a solution of 4j (See Figure 7) (223 mg, 0.49 mmol) in DMF (6 mL) under N. 
5 and warmed to 55°C. After 30 hours, the solution was cooled and the solvent removed. 
The residue was purified by flash chromatography (EtOAc : hexane, 1:1) to give Ij (See 
Figure 7) (198 mg, 83%) as a white foam; [a]''o = + 9. 1 (c 1.41. CHCl,); IR (film) 1747 
cm"' (C=0), 1320. 1 133 cm ' (S-SO.); 'H NMR (500 MHz. CDCl,) 5 1.98,2.05. 2.09. 
2.15 (s X 4, 3H X 4, Ac x 4), 3.35 (s, 3H, CH.SO,-), 3.35-3.38 (m, 2H, -CH,S-), 3.84 

10 (ddd,y6.1 H2,77.1 Hz.7 10.5 Hz, IH, -OCHH'-). 3.92 (td. , 1.1 Hz,/, 6.6 Hz, IH. H- 
5), 4.10-4.21 (m, 3H, H-6. H-6\ -OCHHl-), 4.52 (d, 8.0 Hz. IH, H-1), 5.01 (dd,y., 
10.3 Hz, J,.4 3.5 Hz, IH, H-3), 5.20 (dd, J,. 8.0 Hz, 10.3 Hz, IH, H-2), 5.40 (dd, J,, 
3.5 Hx^J.s 1-1 Hz, IH, H-4); ''C NMR (100 MHz, CDCF,) 5 20.6, 20.7, 20.8 (CH.COO- 
X 4), 36.1, (-CH.S-), 50.6 (CH.SO,-), 61.2 (-OCH,-), 67.0, 68.3, 68.5, 70.8, 71.0 (C-2, C- 

15 3^ C-4, Co, C-6), 101.3 (C-1), 169.5, 170.0, 170.1, 170.4 (CH.COO- x 4); HRMS m/z 
(FAB+): Found 487.0936 (M+H"); C.^H.^O^S, requires 487.0944. 

Preparation of 2-(^-D-Galactopyranosyl)efhyl methanethiosulfonate (le). 

A solution of NaOMe (0.104 M, 0.8 mL) was added to a solution of 4j 

20 (See Figure 7) (778 mg, 1 .7 1 mmol) in MeOH ( 1 0 mL) under N.. After 4 hours, the 
- reaction solution was passed through a Dowex 50W(H^) plug (3 x 1 cm, eluant MeOH) 
and the solvent removed to give 2-bromoethyl p-D-galactopyranoside (4e) (See Figure 7) 
(450 mg, 92%) (The synthesis of unstable 4e has been described previously. Dahmen et 
al., "2-Bromoethyl Glycosides .4.2-Bromoethyl Glycosides in Glycoside Synthesis - 

25 Preparation of Glycoproteins Containing Alpha-L-Fuc-(l->2)-D-Gal and Beta-D-Gal-(1- 
>4)-D-Glcnac," Carbohydr. Res. . 125:237-245 (1984), which is hereby incorporated by 
reference) as a white solid which was used directly in the next step. NaSS02CH3 (180 
mg, 1 .34 mmol) was added to a solution 4e (See Figure 7) (290 mg, 1,01 mmol) in DMF 
(12 mL) under and warmed to 50^C. After 15 hours, the solution was cooled and the 

30 solvent removed. The residue was purified by flash chromatography (MeOH : EtOAc, 
1 :9) to give le (See Figure 7) (229 mg, 71%) as a white foam; [af^ = + 2.9 (c 0.58, . 
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H,0); IR (film) 3358 cm ' (br, O-H), 1306, 1 120 cm ' (S-SO,): 'H NMR (500 MHz, D,0, 
COSY) 5 3.29-3.33 (m, 2H, -CH_,S-)- 3.30 (dd. J, , 7.7 Hz, 7,, 10.0 Hz, 1 H. H-2), 3.35 (s, 
3H, CH,SO,-), 3.43 (dd, J, , 10.0 Hz, J,, 3.6 Hz, 1 H. H-3), 3.48 (ddd, J,_, 0.9 Hz. J, , 4.3 
Hz, Jj, 7.9 Hz. 1 H, H-5). 3.52 (dd, J,^ 4.3 Hz, J,, 1 1 .7 Hz. 1 H, H-6), 3.57 (dd. J,^,- 7.9 
5 Hz, J^e,. 1 1.7 Hz, IH, H-60, 3.70 (dd, J- , 3.6 Hz, J^ , 0.9 Hz, IH, H-4), 3.80 (dt.y^ 1 1.2 
Hz, 7, 6.1 Hz, IH, -OCHH'-), 4.01 {dt.J^ 1 1.4 Hz. J, 5.8 Hz. IH, -QCHH:-), 4.24 (d, 7, , 
7.7 Hz. IH. H-1); ''C NMR (100 MHz. D,0) 5 36.7 (-CH,S-), 50.8 (CH,SO,-), 61.9 (- 
OCH,-), 69.2. 69.6, 71.7, 73.7, 76.2 (C-2. C-3, C-4, C-5, C-6), 104.0 (C-1); HRMS m/z 
(FAB+): Found 319.0523 (M+H'); QH.gOgS. requires 319.0521. 

'0 

Preparation of 2-(2, 3, 6-Tri-0-acetyl-4-0-(2, 3. 4. d-leira-O-acetyl-^-D-galactopyranosyl)- 
^-D-glucopyranosyl) ethyl methanethiosulfonate (Ik). 

BFj.EtjO (4.0 mL, 31.5 mmol) was added dropwise to a solution of 

15 1 ,2,3,6-tetra-0-acetyl-4-0-(2,3,4,6-tetra-0-acetyl-p-D-galactopyranosyl)-[3-D- - 
glucopyranoside (5f) (See Figure 7) (5 g, 7.4 mmol) and Br(CH,),OH (0.65 mLi-9.2 
mmol) in CHjCl, (15 mL) at 0°C under N,. After 1 hour, the solution was warmed to 
room temperature. After 20 hours, the reaction solution was added to ice water ( 1 5 mL) 
and extracted with CHjCl, (20 mL x 2). These extracts were combined, washed with 

20 water (20 mL), sat. NaHCO.-, (aq., 20 mL), water (20 mL), dried (MgSOJ, filtered, and the 
solvent removed. The residue was purified by flash chromatography (EtOAc : hexane, 1 : 
1) to give 2-bromoethyl 2,3,6-tri-0-acetyl-4-C>-(2,3,4,6-tetra-0-acetyl-P-D- 
galactopyranosyl)-(J-D-glucopyranoside (4k) (See Figure 7) (2.94 g, 53%) as a white 
foam; [a.f\ = - 7.8 (c 1.28, CHCI3) [lit., (Dahmen et al., "2-Bromoethyl Glycosides - 

25 Synthesis and Characterization," Carbohvdr. Res.. 1 16:303-307 (1983), which is hereby 
incorporated by reference) [af'^o = - 1 1 (c 1 .3, CHCl,)]; 'H NMR (500 MHz, CDCl,, 
COSY) 8 1 .94, 2.02, 2.02 (s x 3, 3H x 3, Ac x 3), 2.04 (s, 6H, Ac x 2), 2. 1 0, 2. 1 3 (s x 2, 
3H X 2, Ac X 2), 3.38-3.46 (m, 2H, -CH,Br), 3.59 (ddd, Jy ^,-- 2.2 Hz, J 4.9 Hz, J 9.9 Hz, 
IH, H-5'), 3.75-3.80 (m, 2H, H-4', -OCHH'-), 3.85 (td,J,.j 1.1 Hz, J, 6.9 Hz, lH,H-5), 

30 4.03-4.12 (m, 4H, H-6. H-6', H-6", -OCHHl-), 4.45 (d, J,., 7.8 Hz, IH, H-1), 4.48 (dd, 
Jy^.2.2 Hz,J6..6-.. 12.1 Hz, IH, H-6'"), 4.50{d,7,-,- 7.9 Hz, lH,H-r),4.89 (dd,J,-.,- 7.9 
Hz, Jj J 9.6 Hz, IH, H-2'), 4.92 (dd, J,., 10.5 Hz. ^4 3.4 Hz, IH, H-3), 5.08 (dd, J, , 7.8 
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Hz, J, , 10.5 Hz, IH, H-2), 5.18 ay9.6Hz, IH. H-3"), 5.32 (dd, 3.4 Hz, , 1.1 Hz, 
IH, H-4). NaSSO,CH, (87 mg, 0.65 mmol) was added to a solution of 4k (See Figure 7) 
(357 mg, 0.48 mmol) in DMF (6 mL) under and warmed to 50 °C. After 22 hours, the 
solution was cooled and the solvent removed. The residue was purified by flash 
5 chromatography (EtOAc : hexane. 1 1 :9) to give Ik (See Figure 7) (327 mg, 88%) as a 
white foam; [a]"p = -3.7(c 1.0, CHCl,); IR (KBr) 1751 cm ' (C=0), 1323, 1134 cm ' (S- 
SO,); 'H NMR (500 MHz. CDCl,, COSY) 5 1.94, 2.02. 2.02. 2.04. 2.04, 2.1 1, 2.13 (s x 7. 
3H X 7, Ac X 7), 3.29-3.40 (m. 2H. -CH,S-), 3.32 (s, 3H. CH.SO,-). 3.59 (ddd, J,. y 9.9 
Hz, ,.. 4.9 Hz, J,- ,... 2.2 Hz, IH. H-5'), 3.77 (t, 7 9.5 Hz. IH, H-4-). 3.79-3.86 (m. 2H, 

10 H-5, -OCHH'-), 4.03-4.13 (m, 4H, H-6, H-6\ H-6", -OCHHl-), 4.46 (d, J, , 7.8 Hz, IH, 
H-1), 4.50 (d,^,..,. 8.0 Hz, IH, H-T). 4.52 {dd, Jy^-^. 2.2 Hz,J^-,- 11.9 Hz, IH, H-6'"), 
4.87 (dd, J,. ,. 8.0 Hz, 7,.,. 9.6 Hz, IH, H-2'), 4.93 (dd, J, , 10.5 Hz,y,, 3.5 Hz, IH, H-3), 
5.08 (dd, J,.2 7.8 Hz, 7, . 10.5 Hz, IH, H-2), 5.17 (t, J9.3Hz, IH, H-3^), 5.32 (dd.J,, 3.5 
Hz, J, 5 1 .0 Hz, 1 H, H-4); '^C NMR ( 1 25 MHz, CDCl,, DEPT) 5 20.5, 20.7, 20.8, 20.9 (q 

15 X 4, CHjCOO- X 7), 36.0 (t, -CH_,S-). 50.6 (q, CH,SO,-), 60.7, 61 .6, 68.5 (t x 3, -OCH,-, 
C-6, C-6' ), 66.5, 69.0, 70.6, 70.9, 71.3, 72.6, 72.8, 76.0 (d x 8, C-2, C-3, C-4, C-5, C-2'. 
C-3', C-4', C-5'), 100.7, 101.1 (d x 2, C-KC-l'), 169.1, 169.7, 169.7, 170.1, 170.2, 
170.3, 170.4 (s X 7, CH3COO- X 7); HRMS m/z (FAB+): Found 775.1793 (M+H'); 
C29H43O20S2 requires 775.1789. 

20 

Preparation of 2-( 4-0-^-D-Galactopyranosyl-^-D-glucopyranosyl)ethyl 
methanethiosulfonate (If). 

A solution of NaOMe (0.1 M, 0.6 mL) was added to a solution of 4k (See 
25 Figure 7) (877 mg, 1.18 mmol) in MeOH (6 mL) under N,. After 3 hours, the reaction 
solution was passed through a Dowex 50W(H'^) plug (4 x 1 cm, eluant MeOH) and the 
solvent removed to give 2-bromoethyl 4-O-P-D-galactopyranosyl-P-D-glucopyranoside 
(4f) (See Figure 7) (476 mg, 90%) as a white foam which was used directly in the next 
step. NaSSO,CH3 (185 mg, 1.38 mmol) was added to a solution of 4f (See Figure 7) (476 
30 mg, 1 .06 mmol) in DMF (24 mL) under and warmed to 50°C. After 2 1 hours, the 
solution was cooled and the solvent removed. The residue was purified by flash 
chromatography (CHCl, : MeOH : AcOH : H,0, 60:30:3:5) to give If (See Figure 7) (346 
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mg. 68%) as a hygroscopic foam; [a]\ = -f 1.5 (c 1.66, H,0); IR (KBr) 3416 cm ' (br. 

OHX 1311. 1131 cm ' (S-SO,): 'H NMR (400 MHz, DA COSY) 5 3.10-3.13 (m, IH. H- 

2). 3.30 fi../6.0 Hz. 2H -CH,S-). 3.31 (dd. 7.8 Hz. J,,,. 10.3 Hz. IH, H-2 = ). 3.34 (s. 

3H, CH,SO,-). 3.38-3.54 (m, 5H). 3.44 (dd,.A 10.3 Hz, J, ,. 3.3 Hz, IH, H-3'). 3.57 

(dd, J8.4 Hz, J 1 1.4 Hz, IH), 3.59 (dd.y4.9 Hz,./ 7.3 Hz. IH), 3.70 (br d, J,,. 3.3 Hz. 

IH, H-4-). 3.77(dd. J 1.0 Hz, J 1 1.5 Hz, IH), 3.79-3.83 (m, IH, -OCHH'-). 3.97-4.02 (m. 

IH, -OCHH:-), 4.22 (d,7,,,. 7.8 Hz, IH, H-l'), 4.33 (d, J,, 7.8 Hz, IH, H-1); '^C NMR 
(125 MHz, D,0) 6 36.7 (-CH,S-), 50.8 (CH,SO,-), 61.0, 62.1, 69.4, 69.6, 71.9. 73.5. 73.7, 
75.3, 75.9. 76.4, 79.3 (-OCH,-, C-2, C-3, C-4, C-5, C-6, C-2', C-3\ C-4', C-5\ 0-6"), 
103.3, 103.9 (C-1, C-l'); HRMS m/z (FAB+); Found 503.0886 (M+Na'); C,5H,30,,S,Na 
requires 503.0869. 

Example 2 - General Procedure for Modification of Subtilisin Baci//us lentus 
("SBL") Mutants Stored as Flash-Frozen Solutions 

A 1.25 mL frozen aliquot of the mutant enzyme (SBL-N62C, -L2 1 7C, or 
-SI 66C) containing approximately 25 mg of enzyme was thawed and added to 1 .25 mL of 
Modifying Buffer (see below) in a polypropylene test-tube. To this solution was added 
100 jiL of a 0.2 M glyco-MTS reagent solution (la,g-k in MeCN, lb-fin water (See 
Figure 8)). The mixture was sealed, vortexed, and placed on an end-over-end rotator at 
room temperature. When the modification was complete (determined by a specific 
activity assay, using succinyl-AlaAlaProPhe-;7-nitroanilide [e„o = 8800 M ' cm '] 
(Bonneau et al., "Alteration of the Specificity of Subtilisin BPN' by Site-Directed 
Mutagenesis in its SI and 81' Binding-Sites," J. Am. Chem. Snc 11 9: 1026- 1030 (1991). 
which is hereby incorporated by reference) as substrate in O.I M Tris-HCl buffer 
containing 0.005% Tween 80, 1% DMSO, pH 8.6 showing constant activity and titration 
with Ellman's reagent (e„, = 13600 M ' cm ') (Ellman et al., Biochem. Pharmar.nl 7:88- 
95 (1961), which is hereby incorporated by reference) showing no free thiol present in 
solution), a further 50 )xL of the modifying reagent solution was added and the mixture 
placed back on the end-over-end rotator for a further 1 0 minutes. The reaction was 
poured onto a pre-packed, pre-equilibrated G-25 Sephadex® PDIO column and eluted with 
3.5 mL Quench Buffer (see below). The eluant was dialysed at 4»C against 10 mM MES, 
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20 



- 34 - 

1 mM CaCU pH 5.8 (2 x IL. 2 x 45 minutes). The resulting diaiysate was flash frozen in 
liquid nitrogen and stored at -1 8°C. 

N4odifying Buffer; pH 9.5: 140 mM CHES, 2 mM CaCU 
pH7.5: 140mMHEPES.2mMCaCl, 
pH6.5; 140 mM MES.2 mM CaCK 
pH 5.5: 140 mM MES. 2 mM CaCl, 

Quench Buffer: Reactions at pH 7.5 - 9.5: 5 mM MES 1 mM CaCU pH 6.5 
Reactions at pH 5.5: 5 mM MES 1 mM CaCU pH 5.5 



The free thiol content of all chemically modified mutant enzymes ("CMMs"), was 
determined spectrophotometrically by titration with Eilman's reagent (Ellman et al., 
Riochem. Pharmacol. . 7:88-95 (1961), which is hereby incorporated by reference) in 
phosphate buffer 0.25 M, pH 8.0. In all cases, no free thiol was detected. Modified 
enzymes were analyzed by nondenaturing gradient (8-25%) gels at.pH 4.2, run towards 
the cathode, on the Pharmacia Phast-system and appeared as a single band. Prior to ES- 
MS analysis, CMMs were purified by FPLC (BioRad, Biologic System, Hercules, CA) on 
a Source 15 RPC matrix (17-0727-20 from Pharmacia, Bridgewater, NJ) with 5% 
acetonitrile, 0.01% TFA as the running buffer and eluted with 80% acetonitrile, 0.01% 
TFA in a one step gradient. MS m/z (ES-MS): N62C-S-a (See Figure 8) calculated 
27049, found 27051 ; N62C-S-b (See Figure 8) calculated 26925, found 26928; N62C-S-C 
(See Figure 8) calculated 26925, found 26928; N62C-S-d (See Figure 8) calculated 
26925, found 26925; N62C-S-e (See Figure 8) calculated 26925, found 26925; N62C-S-f 
25 (See Figure 8) calculated 27087, found 27087; N62C-S-g (See Figure 8) calculated 

27093, found 27096; N62C-S-Et-p-Glc(Ac), calculated 27009, found 2701 5; N62C-S-Et- 
P-Glc(Ac), calculated 27051, found 27053; N62C-S-i (See Figure 8) calculated 27093, 
found 27098; N62C-S-El-p-Gal(Ac)5 calculated 27051, found 27051; N62C-S-k (See 
Figure 8) calculated 27381, found 27386; L217C-S-P-Glc calculated 26882, found 26879; 
30 L2 1 7C-S-p-Glc( Ac), calculated 26966, found 26962; L2 1 7C-S-p-Glc( Ac)3 calculated 
27008, found 27006; L217C-S-b (See Figure 8) calculated 26926, found 26928; L217C- 
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S-c (See Figure 8) calculated 26926, found 26925; L2 1 7C-S-d (See Figure 8) calculated 
26926, found 26925; L2i7C-S-e (See Figure 8) calculated 26926, found 26928; L21 7C- 
S-f (See Figure 8) calculated 27088^ found 27087; L2 1 7C-S-Et-a-Glc(Ac), calculated 
27010. found 27012: L21 7C-S-Et-p-Glc( Ac), calculated 27052, found 27056; L217C-S- 
Et-a-Man(Ac), calculated 27052, found 27056: L21 7C-S-Et-P-Gal(Ac), calculated 
27052. found 27053; L217C-S-Et-Lac(Ac), calculated 27340. found 27342; SI66C-S-a 
(See Figure 8) calculated 27076, found 27080; S166C-S-b (See Figure 8) calculated 
26952, found 26955; S166C-S-C (See Figure 8) calculated 26952, found 26950; S166C-S- 
d (See Figure 8) calculated 26952, found 26952; SI 66G-S-e (See Figure 8) calculated 
26952, found 26952; S166C-S-f (See Figure 8) calculated 271 14, found 271 12; S166C-S- 
Et-a-GIc(Ac), calculated 27078, found 27078; S166C-S-Et-p-Glc(Ac), calculated 27036, 
found 27040; S 1 66C-S-Et-p-Glc(Ac)3 (major) with S166C-S-h (See Figure 8) (minor) 
and S166C-S-Et-P-Glc(Ac), (minor) calculated 27078 (major), 27120 (minor), 27036 
(minor), found" 27081 (major), 27121 (minor), 27036 (minor); S166C-S-Et-a-Man(Ac), 
calculated 27078, found 27085; S166C-S-Et-P-Gal(Ac), calculated 27078, found 27079; 
S 1 66C-S-Et-Lac(Ac)j calculated 27324, found 2733 1 . 

Example 3 - General procedure for modiflcation of SBL mutants stored as 
lyophilized powders 

This procedure was only used with S 1 56C, which is stored as a lyophilized 
powder to prevent dimerization. Into a polypropylene test tube was weighed about 25 - 
30 mg of lyophilized SI 56C. This was dissolved in the following modifying buffers (2.5 
mL): 

pH9.5: 70 mM CHES, 2 mM CaCl, 
pH7.5: 70 mM HEPES, 2 mM CaCU 
pH6.5: 70 mM MES, 2 mM CaCI, 
pH5.5: 70 mM MES, 2 mM CaCI, 

Glyco-MTS reagent was added and the reaction then proceeded as for the other mutants, 
using the appropriate quench buffer. MS m/z (ES-MS): S156C-S-a (See Figure 8) 
calculated 27076, found 27079; S156C-S-b (See Figure 8) calculated 26952, found 
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26955; S156C-S-C (See Figure 8) calculated 26952, found 26952; S156C-S-d (See Figure 
8) calculated 26952, found 26952; S156C-S-e (See Figure 8) calculated 26952, found 
26952; S156C-S-f (See Figure 8) calculated 271 14, found 271 15; S156C-S-g (See Figure 
8) calculated 27120, found 27123; SI56C-S-h (See Figure 8) calculated 27120, found 
5 27122; S156C-S-i (See Figure 8) calculated 27120, found 27123; S156C-S-j (See Figure 
8) calculated 27120, found 27120; S156C-S-k (See Figure 8) calculated 27408. found 
27411. 

ExampJe 4 - Contents of Acetylated Glyco-CMM Libraries 

10 

The levels of acetylation of giyco-CMMs after modification of SBL 
cysteine mutants with la, g-k (See Figure 8) at various pH levels were determined and are 
set forth in Tables 1 and 2, below. 

15 Table 1. Levels of Acetylation of Glyco-CMMs after Modification of SBL Cysteine 
Mutants with la at various pH^ 

X^eagent 

Enzym>\ pH 9.5 pH 7.5 pH 5.5 

N62C 4 ~- 4 

S156C 4 - 4 

S166C 4 - 4 

L217C 0 2' 3' 

^ Isolated as single species unless indicated. ^ Single product mass by ES-MS. 



BNSDOCID: <WO_0001712A2J_> 



wo 00/01712 



PCT/US99/15138 



-37- 

Table 2. Levels of Acetylation of Glyco-CMMs after Modification of SBL Cystei 
Mutants with la,g-k at pH S.S"" 



Reagent 
Enzyme\ 


la 


Ig 


Ih 


li 


Ij 


Ik 


N62C 


4 


4 


J 


4 


3" 


7 


S156C 


4 


4 


4 


4 


4 


7 


SI66C 


4 




3',4'',2' 


■ 3' 


-.b 
J 


5" 


L217C 


3" 


2" 


-lb 
J 


3" 


3" 


6" 



' Isolated as single species unless indicated. 
" Single product mass by ES-MS. 
5 ' Major component. 
Minor component. 

Example 5 - Incubation of L217C-S-p-Glc(Ac)3 at pH 9.5 

'° "^he general procedure for modification of SBL mutants stored as flash- 

frozen solutions was used to incubate 1.26 mg of L217C-S-P-Glc(Ac)3 as a 0.5 mL 
aliquot in the absence of MTS reagent for 2 hours to give L2 1 7C-S-p-Glc as the sole 
product. MS /n/z (ES-MS): L217C-S-P-Glc calculated 26882, found 26885. 

1 5 Example 6 - Active Site Titrations 

The active enzyme concentration was determined as previously described 
(Hsia et al., "Active-Site Titration of Serine Proteases Using a Fluoride-Ion Selective 
Electrode and Sulfonyl Fluoride Inhibitors." Anal. Biochem. . 242:221-227 (1996), which 
10 is hereby incorporated by reference) by monitoring fluoride release upon enzyme reaction 
with a-toluenesulfonyl fluoride (PMSF) as measured by a fluoride ion sensitive electrode 
(Orion Research 96-09). The active enzyme concentration determined in this way was 
used to calculate k^^ values for each CMM. 



BNSOOaD: <WO_0001712A2J_> 



wo 00/01712 



- 38 - 



PCT/US99/15138 



Example 7 - Kinetic Measurements 

Michaelis-Menten constants were measured at 25(± 0.2)° C by curve 
fining (GraFit' 3.03. Erithacus Software Ltd.. Staines. Middlesex. UK) of the initial rate 
5 data determined at nine concentrations (0.125 mM-3.0 mM) of succin\i-AAPF-/7N A 
substrate in 0,1 M Tris-HCl bulTer containing 0.005% Tween 80. 1% dimeth\ Isufoxidc 
("DMSO"). pH 8.6 8800 \4 ' cnV' ) ( Bonneau et a!., J. Am. Chem. Soc . 1 10; 1026- 

1030 (1991 ). which is hereby incorporated by reference). 

10 Example 8 - Controlled Site Selective Glycosylation of Proteins by a Combined Site- 
Directed Mutagenesis and Chemical Modification Approach 

Four SBL sites at different locations and of different characteristics were 
selected for mutation to cysteine in order to provide a broad test of the glycosylation 

15 methodology- SI 56 of the S, -pocket (Nomenclature of Schechter: Berger. Biochem. 

Biophvs. Res. Commun. . 27:157-162 (1967), which is hereby incorporated by reference) 
is a surface-exposed residue that permits the introduction of externally-disposed glycans 
mirroring those found naturally in glycoproteins ( Molecular Glvcobiologv . Fukuda et aL, 
Eds., Oxford University. Oxford (1994), which is hereby incorporated by reference). In 

20 contrast, N62 in the S. pocket. S166 in the S, pocket, and L217 in the S,' pocket have side 
chains which are internally oriented and test the applicability of the method for 
introducing sugars at hindered locations. Broad applicability with respect to the sugar 
moiety was evaluated by using the representative series of protected and deprotected. 
mono- and disaccharide methanethiosulfonates ('^MTS'') la-k (see Figure 8). These were 

25 prepared from their parent carbohydrates in good to excellent yields (Figures 6 (Reagents 
and Conditions: (i) Ac.O, py then HBr, AcOH; (ii) NaSSO.CH,. EtOH. 90° C; 
(iii) Br(CH.),OH. BF,.Et,0 then Ac,0, py; (iv) NaSSO.CH,, DMF. 50° C: (v) NaOMe. 
MeOH; (vi) Ac.O, py then Br(CH.).OH, BF,.Et,0. DCM) and 7 (Reagents and 
Conditions: (i) Ac.O, py. 92% for 5d, 99% for 5e; Ac.O. NaOAc, 82% for 5f; 

30 (ii) Br(CFI0,OH, BF,.Et,0. DCM. 70% for 4i, 67% for 4], 53% for 4k: (iii) NaOMe. 

MeOR 96% for 4d, 92% for 4e. 90% for 4f: (iv) NaSSO.CFI,. DMF, 50° C, 80% for Id. 
71% for le. 68% for If. 88% for li. 83% for Ij. 88% for Ik)). Two types of 
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glycosylating reagents, the anoineric methanethiosulfonate la and the ethyl-tethered 
methanethiosulfonates lb,c,g,h, vvere prepared from D-glucose (2a, Figure 6). The 
preparation of these reagents in fully protected la,g,h and deprotected lb,c forms allowed 
the effects of increased steric bulk and hydrophobicity to be assessed. Untethered MTS 
5 reagent la was readily prepared from acetobromoglucose (3) using NaSSO.CH. as shown 
in Figure 6 (Prepared from D-glucose according to Scheurer et al.. J. Am, Chem. Soc . 
76:3224 (1954), which is hereby incorporated by reference). For the preparation of lb,g, 
an a-linked ethyl tether was introduced using Fischer glycosidalion of D-glucose (2) with 
2-bromoethanol. Treatment of the tetraacetylbromide 4g with NaSSO^CH;; allowed the 

10 preparation of the peracetylated a-gluco-MTS Ig in an excellent 90% yield. Zemplen 
deacylation (Zemplen et al., Ber. Dtsch. Chem. Ges. . 56:1705-1710 (1923), which is 
hereby incorporated by reference) of bromide 4g and subsequent displacement of bromide 
by methanethiosulfonate ion proceeded smoothly to yield the fully deprotected a-gluco- 
MTS lb in 69% yield. The P-D-gluco-MTS reagents Ic and Ih, which are epimeric at C- 

15 1 relative to lb and Ig, respectively, were prepared from the corresponding peracetylated 
P-bromide 4h. The preparation of 4h took advantage of well-defmed methodology 
utilizing Lewis acid catalyzed displacement of anomeric acetates by alcohols (Dahmen et 
aL, "2-Bromoethyl Glycosides - Synthesis and Characterization," Carbohvdr. Res. , 
1 16:303-307 (1983), which is hereby incorporated by reference). The protected bromide 

20 4h was elaborated to the corresponding peracetylated (Ih) and deprotected (Ic) p-gluco- 
MTS reagents in an essentially identical manner to that used for the epimeric a-gluco- 
MTS reagents. Thus, using NaSS02CH3, 4h gave Ih in 78% yield and, following 
deprotection, 4c (Helferich et al., Just. Lieb. Ann. Chem. , 541:1-16 (1939), which is 
hereby incorporated by reference) afforded Ic in 68% yield from 4h. Parallel routes 

25 allowed similarly efficient access to the a-D-manno-MTS reagents Id and li, which are 
epimeric at C-2 relative to lb and Ig, respectively, and the p-D-galacto-MTS reagents le 
and Ij, epimeric at C-4 relative to Ic and Ih, respectively (Figure 7). The ready 
adaptability of this method to oligosaccharides was illustrated by the preparation of the 
peracetylated (Ik) and fully deprotected (If) disaccharide lacto-MTS reagents, in good 

30 overall yields from lactose (2f) of 38% and 27% respectively without cleavage of the 
interresidue bond. 
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Examnle 9 - Site Specific Glycosylation. 

The glyco-MTS reagents la-k (See Figure 8) were reacted with the chosen 
cysteine mutants SBL-N62C. -S 1 56C, -S 1 66C and -L2 1 7C in aqueous buffer under 

5 conditions described previously (Stabile et al., Biooro. Med. Chem. Lett. , 6:2501-25 12 
(1996); Berglund et aL, J. Am. Chem. Soc . 1 19:5265-5266 (1997); DeSantis et al.. 
Biochem. , 37:5968-5973 (1998), which are hereby incorporated by reference). These 
reactions were rapid and quantitative, as judged by monitoring of changes in specific 
activity and by titration of free thiols with EUman's reagent (Ellman et al., Biochem. 

10 PhanmacoL , 7:88-95 (1961), which is hereby incorporated by reference). The 

glycosylated chemically modified mutants (CMMs) were purified by size-exclusion 
chromatography and dialysis, and their structures were confirmed by rigorous ES-MS 
analyses (± 7 Da) as shown in Table 3 below: 
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Table 3. 



Properties and Kinetic Parameters" of Modi tied Enzymes 



Enir> Reactanl 


Pocket 


MTS 


Reacn. pH 


Product(s) 




(mM) 




Enzyme 




Reagent 










(,<"'mM-') 


1 SBL-VVT 


. 


. 






15314 


0.73 - 0.05 


209 = i 5 


: N62C 










174 ±9 


i.90 ± 0.20 


92 ± 1 1 






la 


9.5 


N62C-S-a' 


67.9 ± 3.5 


0.52'u: 0.07 


130.6 X 18.S 






lb 


6.5 


N62C-.S-b'' 


135.3 ± 3.5 


0.94 ± 0.05 


143.9 = 8.5 


> 




Ic 


6 5 


N62C-S-C'' 


132.7 i 4.0 


1-25 ± 0.08 


106.2 = 7-5 


6 




Id 


6.5 


N62C-S-d** 


132.9±3.l 


1 .04 ± 0.05 


127.8 ± 6-8 


7 




le 


6.5 


N62C-S-e'' 


1 19.3 ± 3.6 


0.99 ± 0.07 


1 20.5 ± 9.3 


8 




If 


6.5 


N62C-S'r' 


129.8 ± 2.4 


1 .04 ± 0.04 


1 24.8 ± 5.3 


9 




Ig 


5.5 


N62C-S-g*= 


120.0 - 2.7 


0.52 ± 0.03 


230.8 = 14.3 


10 




Ih 


6.5 


N62C-S-Et-p-Glc(Ac)." 


87.7 ± 4.2 


1 .63± 0. 1 5 


53.8 ± 5.8 


i ) 




1 h 


5.5 


N62C-S-Ei-p-Glc(Ac>..^ 


100.3 ± 3.5 


t 86 ± 0.1 2 


53.9 X 4.0 


12 




li 


5.5 


N62C-S-i'* 


123. 0± 1.6 


1 .05 ± 0.03 


! 1 7.1 ±3.7 


i3 




Ij 


5.5 


N62C-S-Et-(J-Gal(Ac)-/ 


103.4 ± 4.3 


2.36 ± 0.17 


43.8 ± 3.6 


14 




Ik 


5.5 


N62C-S-k'' 


64 .9 ± 1.5 


0.88 ± 0.05 


73.8 ± 4.5 


15 L2 1 7C 


S.' 








4 1 ± 1 




5 I ± 3 


16 




la 


9.5 


L2 1 7C-S-fl-Cjlc'' 


27.7 ± 0.4 


0 79 ± 0 fi 1 


35. 1 ± 1 .4 


1 7 




fa 


7.5 




44.9 ± 2.0 


0 44 -fc 0 Oft 


1 0"? 0 rfc I 4 


IS 




la 






36.3 ± 0.8 




I 00 R ± R 7 


19 




1 b 


6.5 


L21 7C-S-b'* 


■^78^06 




86 3 ± 2 7 


20 




Ic 


6.5 




50.6 ± 0.9 


0 ft7 + 0 0^ 


75.5 ± 3.6 


21 




Id 


6.5 


L21 7C-S-d*' 


62.0 ± 1 .3 




1 1 2-7 ±66 


22 




le 


6.5 


L2 1 7C-S-e*' 


46.2 ± 0.8 




73.3 ± 3.7 


23 




If 


6.5 


L217C-S-f*' 


30.4 ± 0.6 




66. 1 ± 4.5 


24 




Ig 


5.5 


L2 i 7C-S-Etra-Glc( Ac)>' 


72.7 ± 3.1 


0.73 ± 0.08 


99.6 ± 11.8 


25 




Ih 


5-5 


L2 1 7C-S-Et-p-Glc(Ac)-/ 


29.4 ± 0.8 


0.93 ± 0.06 


31.6 ± 2.2 


26 




li 


5.5 


L2 ! VC-S-El-a-ManCAc);*" 


97.8 ± 2.4 


0.59 ± 0.04 


165.8 ± 12.0 


27 




H 
'J 


5.5 


L2 1 7C-S-Ei-p-Gal( Ac)-/ 


39.2 ± 0-8 


! . 1 7 ± 0-05 


33.5 ± 1.6 


28 




Ik 


5.5 


L2 1 7C-S-Et-Lac(AcV,<^ 


27.1 ± 0.6 


0.69 ± 0.04 


39.3 ± 2.4 


29 S I 56C 


s. 








1 25 ± 4 


0 S5 + 0 06 


147+ [ 1 


30 




la 


9.5 


S 1 56C-S-a'' 


54.8 ± 1 .3 


0.70 ± 0.04 


78.3 ± 4.8 


3 1 




lb 


6.5 


S 1 56C-S-b'' 


77.0 ± 1.2 


0.84 ± 0.03 


91.7 ± 3-6 


32 




Ic 


6.5 


.SI56C-S-C*' 


76.6 ± 1.7 


0.73 ± 0.04 


104.9 ± 6.2 


33 




Id 


6.5 


SI56C-S-d'' 


88.6 i 2.8 


0.79±0.06 


112.2 ±9.2 


34 




le 


6.5 


SI56C-SV 


78.9 i 1.9 


0.89 ± 0.04 


89.7 ±4.4 


35 




If 


6.5 


SI56C-S-f*' 


63.6 ± 1.4 


0.89 ± 0.05 


71.8 ±4.3 


36 




Ig 


5.5 


Sl56C-S-g'' 


43.6 ± 0.8 


0.78 ±0.04 


55.9x3.0 


37 




Ih 


5.5 


51560-5^ 


64.0 ± 1.3 


0.72 ±0.04 


88.9 ± 5.2 


38 




li 


5.5 


S 1 56C-S-i* 


60.3 ± 0.9 


0.71 ±0.03 


84 .9 ±3.8 


39 




tj 


5.5 


S 1 560-5-]" 


51.9 ±0.6 


0.61 ±0.02 


85.1 ± 3.0 


4 n 




1 k 


5-5 






A TOj-A AI 


Of A ± l.Ji 


41 SI66C 


s, 








42 ± 1 


0.50 ± 0.05 


84 ±9 


42 




la 


9.5 


SI66C-S-a*' 


33.8 ± 1.3 


0.66 ± 0.06 


51.2 ±5-0 


43 




lb 


6.5 


Sl66C-S-b'' 


8I.9± 1.1 


1.14 ±0.03 


71.8±2.1 


44 




tc 


6.5 


SI66G-S-C'' 


67.0 ±2.2 


0.99 ± 0.07 


67.6 ±5.3 


45 




Id 


6.5 


SI66C-S-d'» 


76.5 ± 2.0 


I.17±0.07 


65.4 ± 4.3 


46 




le 


6.5 


SI66C-S-C'* 


62.2 ± 1 .4 


1.08 ±0.05 


57.6 ±3.0 


47 




If 


6.5 


SI66C-S-f*' 


58.2 ± 1.2 


1.02 ±0.04 


57.1±2.5 


48 




Ig 


5.5 


SI66C-S-Et-a-Glc(Ac),* 


3I.0±0.8 


0.77 ±0.05 


40.3 ± 2.8 


49 




Ih 


6.5 


Si66C-S-Et-3-Gic(Ach^ 


95.0 ±2.1 


0.87 ±0.05 


109.2 ±6.7 










SI66C-S-Et-p-Glc(Ac)-.'' 








50 




Ih 


5.5 


SI66C-S.E(-P-Glc(Ac)/ 


72.9 ± 1.7 


0.65 ± 0.04 


I 12.2 ± 7.4 










S166C-S-h^ 








51 




li 


5.5 


S 1 66C-S-Ei-a-Man( Ac);' 


67.7 ± 1.9 


1.64 ±0.09 


41.3 ±2.5 


52 




<j 


5.5 


SI66C-S-Et-P-Gal(Ac)-'* 


65.1 ±0.9 


0.80 ±0.03 


8I.3±3.3 


53 




Ik 


5.5 


S166C-S-Et-Lac(Ac)5*' 


67.4 ± 1.6 


1.65 ±0.07 


40.8 ±2.0 



Michaelis-Menten constarits were measured at 25 °C according lo the initial rates method in 0.1 M Tris-HCl buffer at pH 8.6. 
0.005% Twccn 80, 1% DMSO, suc-AAPF-pNA as the substrate. 
5 Single species. 

' Single product mass by ES-MS. 
*" Major component. 
* Minor component. 
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The CMMs each appeared as a single band on non-denaturing gradient PAGE, thereby 
estabHshing their high purities. The active enzyme concentration of the resuhing CMM 
solutions was determined by active site titration with a-ioluenesulfonyl fluoride (PMSF) 
5 using a fluoride ion-sensitive electrode (Hsia et al.. "Active-Site Titration of Serine 
Proteases Using a Fluoride-Ion Selective Electrode and Sulfonyl Fluoride Inhibitors." 
Anal. Biochem. . 242:221-227 (1996), which is hereby incorporated by reference). In all 
cases, modification with the fully deprotected reagents Ib-f (See Figure 8) led to site- 
specific glycosyiations and the formation of single glycoforms. Furthermore, 

10 modification with the protected MTS reagents la, g-k (See Figure 8) gave products with 
controllable levels of acetylation. Through adjustment of pH and appropriate selection of 
the glycosylation site, differently acetylated glycoforms of SBL were prepared. This 
ability to modulate the level of acetylation through pH-control vastly expands the 
structural variety of glyco-CMMs that can be conveniently accessed and its scope was 

15 probed through the reaction of la (See Figure 8) v/ith SBL-N62C, -S156C, -S166C, - 
L217C (Figure 9). 

The extent of deacetylation during modification was highly site-dependent. 
Modification of L217C with reagent la (See Figure 8) at pH 9.5 was accompanied by 
complete in situ deacetylation, and the sole product was the fully deprotected 

20 glucosylated-SBL, L217C-S-P-Glc. In contrast, treatment of N62C, S156C, and S166C 
with la (See Figure 8) at pH 9.5 yielded only fully acetylated products, N62C-S-a (See 
Figure 8), S156C-S-a (See Figure 8), and S166C-S-a (See Figure 8), respectively. To 
examine the effects of pH upon deacetylation, the reaction of L217C with la (See Figure 
8) was chosen. At pH 7.5 and 5.5, the products retained two and three acetate groups, 

25 forming L217C-S-p-Glc(Ac)2 and L217C-S-P-Glc(Ac)3, respectively. In all cases, 
complete integrity of the site selectivity was retained. 

This valuable site-dependent deacetylation was attributed to a novel 
intramolecular SBL-catalyzed process. Although acetate esters are moderately 
chemically labile in aqueous solution at pH 9.5, they are not at either pH 7.5 or 5.5 

30 (Greene et al.. Protective Groups in Organic Synthesis . 2ncl ed. New York, Wiley ( 1 99 1 ), 
which is hereby incorporated by reference). The striking differences in behavior during 
modification between L217C and the three other mutants N62C, S156C, and S166C 
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under identical reaction conditions discounted the possibility of deacetylation prior to 
modification. In addition, it was noted thai position 217 bears an internally-oriented side 
chain and that modificaiion of surface exposed position 156 showed no sign of 
deacetylation. This observation discounted both the possibility of either in siiu chemical 
5 deacetylation or iniermolecular enzymatic deacetylation. Furthermore, this ability of SBL 
to intramolecularly deacetylate was confirmed by the reaction of L217C-S-P-Glc(Ac); at 
pH 9.5. Incubation of L21 7C-S-P-Glc(Ac)3 under standard modification reaction 
conditions, but without reagent la (See Figure 8), gave L217C-S-P-Glc as the sole 
product (Figure 9). 

10 The enormous potential of this method was demonstrated by the 

preparation of a small library of differently acetylaied glycosylated CMMs through the 
reaction of SBL-N62C -S156C, -S166C, and -L217C with MTS reagents Ig-k (See 
Figure 8). Using the pH-activity profiles of wild-type ("WT") and CMMs of SBL as a 
guide. pH 5.5 and 6.5 were chosen to minimize deacetylation. -Typically, the specific 

15 activity of SBL and its CMMs drops sharply below pH 7.5 to levels that at pH 5.5 are 5- 
20% those at optimal pH (8.5-9.5) (Desantis et al., "Chemical Modifications at a Single 
Site Can Induce Significant Shifts in the pH Profiles of a Serine Protease," J. Am. Chem. 
Soc., 120:8582-8586 (1998), which is hereby incorporated by reference). As expected, 
this drop in hydrolytic activity was reflected in the products of these modifications with 

20 Ig-k (See Figure 8), which in all cases retained two or more acetate groups. 

For example, at pH 5.5 reactions of SBL-L21 7C and -SI 66C created 
singly deacetylated CMMs, with the exception of pure dideacetylated glyco-CMMs 
L21 7C-S-Et-a-Glc(Ac)2, and S166C-S-Lac(Ac)5. The formation of the latter may reflect 
the presence of two primary acetates in disaccharidic MTS reagent Ik (See Figure 8). 

25 Primary acetate groups are typically more labile than secondary acetate groups under 
conditions of intermolecular enzymatic deacetylation (Bashir et al., "Enzymatic 
Esterification and De-Esterification of Carbohydrates - Synthesis of a Naturally- 
Occurring Riiamnopyranoside of P-Hydroxybenzaldehyde and a Systematic Investigation 
of Lipase-Catalyzed Acylation of Selected Arylpyranosides," J. Chem. Soc, Perkin 

30 Trans. L 2203-2222 (1995), which is hereby incorporated by reference). In contrast, 

reactions of SBL-S156C gave only the fully acetylated CMMs, S156C-S-g-k (See Figure 
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8). This uniform lack of deacetylation observed for surface exposed glycans at position 
156 was consistent with an intramolecular enzyme-catalyzed mechanism requiring 
internally-oriented acetate groups. Interestingly, the reactions of SBL-N62C were also 
determined by the anomeric configuration of Ig-k (See Figure 8) with a-MTS reagents 
5 lg,i (See Figure 8) giving products that retained all acetate groups while p-MTS reagents 
Ih.j.k (See Figure 8) were monodeacetylated. The range of accessible acetylated glyco- 
CMMs was further extended tlirough modification at pH 6.5. For example, at position 62, 
it allowed the introduction of diacetylated p-glucose, forming N62C-S-Et-P-Glc(Ac)3, in 
place of the triacetylated N62C-S-Et-P-Glc(Ac)3 formed at pH 5.5. The range of 
10 acetylation at position 166 was similarly expanded through the formation S166C-S-Et-p- 
Glc(Ac), in place of S166C-S-Et-p-Glc(Ac)v 

Example 10 - Glycan Structure-Hydrolytic Activity Relationships 

15 The effects of glycosylation upon SBL were assessed by the determination 

of and K^.^ for the hydrolysis of succinyl-AAPF-p-nitroanilide (Suc-AAPF-pNA) at 
pH 8.6. The kinetic parameters of the 48 CMMs generated were compared with those of 
WT and unmodified mutants in Table 3. The excellently selective and controlled method 
shown in Figure 8 allowed the introduction of structurally related monosaccharides, D- 

20 glucose, D-galactose, and D-mannose, in addition to the more sterically bulky 

disaccharide lactose. From the resulting glycosylated CMMs, a detailed and precise set of 
'Structure-activity relationships was generated (Figures 10-12). 

At position 62, in the pocket, the 2,3-fold reduction in KJK^ caused by 
mutation to cysteine was partially restored by glycosylation (Figure lOA). The 

25 introduction of ethyl-tethered a- or P-glucose, P-galactose or a-mannose to N62C 

increased kJK^, and formed N62C-S-b-e (See Figure 8) with kJK,.^ 1 .5- to 2-fold lower 
than WT. Despite its steric bulk and high hydrophilicity, disaccharidic lacto-CMM 
N62C-S-f (See Figure 8) also showed higher activity than N62C with kJK^^oviXy 1.7-foId 
lower than WT. 

30 The effects of the mutation of position 217 in the S,' pocket were 

intrinsically more dramatic as indicated by a value of KJKf,f for L21 7C that is 4-fold 
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lower than WT (Figure I OB). The introduction of deprotecied untethered glucose, 
fomiing L2 1 7C-S-p-Glc, lowered k^.JK^u further to 6-fold lower than WT. In contrast, 
glycosylation of position 217 with ethyl tethered N4TS reagents Ib-f (See Figure 8) 
restored activity and k^JK^^s for L21 7C-S-b-f (See Figure 8) were similar to each other in 
5 the range 2.5- to 3.1 -fold lower than WT. This striking difference between tethered 

L217C-S-b-f (See Figure 8) and untethered L2 1 7C-S-p-Glc illustrated that SBL tolerates 
the replacement of hydrophobic Leu with highly hydrophilic carbohydrate moieties when 
they are linked by a hydrophobic ethyl spacer group better than directly-linked Cys-S-P- 
Glc. This may indicate that a structural requirement for efficient amidase activity is a 
10 closely-bound hydrophobic residue in the subsite of SBL and contrasts sharply with 
the excellent enhancement of esterase activity caused by the same Cys-S-P-Glc 
substitution. 

Mutation of position 156 in the S, pocket to cysteine caused a L4-fold 
drop in /r.^/AT^/ (Figure IOC). Subsequent introduction of deprotected S-Et-a-Glc, side 

15 chain b (See Figure 8), resulted in a KJKf^f for S 1 56C-S-b (See Figure 8) that was 2.3- 
fold lower than WT. From SI 56C-S-b to -f (See Figure 8), k^JK^^iS varied in an arced 
manner peaking at 1 .9-fold lower than WT for S 156C-S-d (See Figure 8) and then 
decreasing monotonically to a k^JKj^ for S156C-S-f (See Figure 8) that was 3-fold lower 
than WT. The similar A'^^ values for these S 1 56 CMMs to those of SBL-WT were 

20 indicative of these modifications having little effect upon ground state binding and were 
consistent with the surface exposed orientation of the SI 56 side chain. 

At position 166, in the S, pocket, the 2.5-fold decrease in /:„yA^/ caused by 
mutation to cysteine was amplified by modification with lb (See Figure 8) and led to a 
Ka/K^ value 3-fold lower than WT for S166C-S-b (See Figures 8 and lOD). From 

25 S 1 66C-S-b to -f (See Figure 8), k^JKj,,f decreased monotonically to a k^JK^,, for S 1 66C-S- 
f (See Figure 8), in which the S, binding site was occupied by the sterically bulky 
disaccharide lactose, that was 3.8-fold lower than WT. 
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Example 11 - Kinetic Effects of Glycosylation with Acetylated Carbohydrates. 

The enormous potential of the controlled site-selective glycosylation 
approach depicted in Figure 8 was illustrated by the great variety of changes in k^JK^^jXhai 
were caused by the introduction of acetylated side chains a,g-k (See Figure 8) to SBL. 

5 These dramatic changes contrast with the slight variations found for deprotected side 
chains b-f (See Figure 8). For example, at position 62, an alternating decrease-increase 
pattern was observed (Figure 1 1 A). This resulted in a kJK,, for tetraacetylated a-gluco- 
CMK4 N62C-S-g (See Figure 8) that was 1.1 -fold higher than WT. Similar alternating 
patterns were also seen at positions 217 (Figure 1 IB) and 166 (Figure 1 ID). At position 

10 1 56, variations were slight, which was consistent with its surface exposed orientation 
(Figure 1 1 C). 

To examine the cause of these variations, the k^JK^fS of acetylated 
glycosylated CMMs were compared with those for deprotected glycosylated CMMs with 
the same glycan structure and stereochemistry (Figure 12). This separated the effects of 

15 acetylation from the effects of glycosylation and allowed the underlying effects of 
modification to be dissected. It was clear from Figure 1 2 that the anomeric 
stereochemistry of the acetylated glycans modulates KJK^- 

For example, at position 62 (Figure 12 A) comparison of N62C-S-b,c (See 
Figure 8) with N62C-S-g (See Figure 8) and N62C-S-Et-p-Glc(Ac)2.3 showed that 

20 increasing the number of acetate groups from zero to four, from N62C-S-b (See Figure 8) 
to N62C-S-d (See Figure 8), increased kJK^,f 1 .6-fold for the a-gluco side-chain b (See 
Figure 8). In contrast, increasing the number from zero to two or three, from N62C-S-C to 
N62C-S-Et-P-Glc(Ac)2o,3, was detrimental for the P-gluco side-chain c (See Figure 8) and 
led to a 2-fold decrease. Similarly, N62C-S-Et-p-Gal(Ac)3 displayed a distinctly lower 

25 kJK^, than N62C-S-e (See Figure 8) that was 5-fold lower than WT. These changes in 
KJK^^f upon acetylation were manifested largely through increased or decreased ground 
state binding, of which the most striking example was a AT^/ for N62C-S-Et-p-Gal(Ac)3 
that was 2.4-fold higher than deprotected galacto-CMM N62C-S-e (See Figure 8). 

At position 2 1 7, control of the level of acetylation through pH, as shown m 

30 Figure 9, had allowed the introduction at position 217 of untethered p-D-glucose bearing 
zero, two, and three acetate groups. As Figure 12B illustrates, the addition of two or three 
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acetate groups restored k^.JK^^f from 6-fold lower than WT for L21 7C-S-p-Glc to 2-fold 
lower than WT for L2 1 7C-S-(3-Glc(Ac), or L2 1 7C-S-P-Glc(Ac),. This showed that 
acetylation allowed fine-tuning of activity and paralleled increases in the esterase k^ JK^^s 
of these CMMs. 

5 The same trend in k^ JK^, was observed for the L21 7C ethyl tethered 

CMMs as at position 62: acetylation was beneficial to a-tethered but detrimental to P- 
tethered CMMs. For example, increasing the number of acetate groups in the a-linked 
glucose moiety from zero to two, i.e., from L21 7C-S-b (See Figure 8) to L217C-S-Et-a- 
Glc(Ac)25 increased k^ JK^j to 2-fold lower than WT. In contrast, increasing the number of 

10 acetate groups in the epimeric P-linked moiety from zero to three, i.e., from L217C-S-C 
(See Figure 8) to L21 7C-S-Et-P-Glc(Ac)3, halved kJK^, to 6-fold lower than WT. 
Similarly, the k^JKf,^ of a-linked L21 7C-S-Et-a-Man(Ac)3 was 1 .5-fold higher than the 
corresponding deprotected L217C-S-d (See Figure 8), while P-linked L217C-S-Et-p- 
Gal(Ac)3 was 2-fold lower than the corresponding deprotected L217C-S-e (See Figure 8). 

15 Consistent with its surface exposed orientation, the changes at position 1 56 

caused by acetylation were slight and no variation with anomeric stereochemistry was 
seen (Figure 12C). Increasing the number of acetate groups from zero to four, from 
S156C-S-b-e (See Figure 8) to S156C-S-g-j (See Figure 8), decreased kJK^^hy\,Q5- to 
1.6-fold. The most sterically bulky lactose side chain (-k) (See Figure 8) gave rise to the 

20 lowest KJK^i at position. 1 56, 3.2-fold lower than WT, and indicated that even at the 
surface of SBL the introduction of sterically bulky groups still allowed tailoring of 

At position 166, the effects of increased acetylation were, as at positions 
62 and 217, modulated by glycan anomeric configuration (Figure 12D). However, the 

25 direction of these increases and decreases was reversed: acetylation was beneficial to P- 
tethered but detrimental to a-tethered CMMs. For example, the a-tethered S166C-S-Et- 
a-Glc(Ac)3 had a 1.8-fold lower A^^/A^Af value than the corresponding deprotected S166C- 
S-b (See Figure 8), while p-linked CMMs S166-S-Et-P-Glc(Ac)2 3 had 1.6-fold higher 
/r^y^A^ values than the corresponding fully deprotected S166-S-C (See Figure 8). Again, 

30 these variations were largely manifested through changes in ground state binding. For 
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example, K,, increased 1.4-fold from SI 66C-S-Et-a-Man (-d) (See Figure 8) lo S166C-S- 
Et-a-Man(Ac)v 

It should be noted that the changes in activity of the lacto-CMMs upon 
acetylation fell largely outside of these trends and at all four positions acetylation of the 
5 bulky, disaccharidic side-chain f (See Figure 8) caused a general decrease in k^JK.fS. For 
example, at position 62 heptaacetylation. from N62C-S-f (See Figure 8) to N62C-S-k 
(See Figure 8), resulted in a lowering of A;,,//:.,/ to a value that is 3-fold lower than WT. 
Despite the greater steric bulk of side chain k (See Figure 8). this drop was a consequence 
of a lower A,,,. 2-fold lower than N62C-f (See Figure 8), rather than a higher K,,. In fact. 
10 the a:,; value ofN62C-S-k (See Figure 8) was 1 .3-fold lower than N62C-S-f (See Figure 
8). Similarly, L217C-S-Et-Lac(Ac), and S166C-S-Et-Lac(Ac)5 had 1.7-fold and L4-fold 
lovs'er k^JKj^fS than the corresponding deprotected CMMs, respectively. 

In summary, the strategy of site-directed mutagenesis combined with 
chemical modification was exploited for the site-selective glycosylation of SBL. This 
15 method was general, versatile and allowed the preparation of pure glycoforms which 
constitute the first examples of regio- and glycan- specific protein glycosylation at 
predetermined sites. Careful control of a novel SBL-catalyzed intramolecular 
deacetyiation greatly expanded the scope of this method and through reaction of SBL- 
N62C, -S156C, -S166C, and -L217C with peracetylated MTS reagents la,g-k (See Figure 
20 8) allowed the introduction of glycans with precisely modulated levels of acetylation. 

The glycosylated CMMs formed display kJKj,, values that ranged from 
1 .1-fold higher than WT to 7-fold lower than WT. Without the use of this highly 
selective glycosylation technique, the determination of such precise trends would be 
unachievable and variations caused by previous non-specific glycosylation could only be 
25 interpreted in a general manner. It has been demonstrated that subtle differences in 
carbohydrate structure may be used to fine tune the activity of SBL. For example, the 
anomeric stereochemistry of the glycans introduced modulated changes in k^JK^^f upon 
acetylation. At positions 62 and 217, acetylation enhanced the activity of a-tethered 
CMMs but decreased that of p-tethered. This trend was reversed at position 166 where, in 
30 contrast, acetylation enhanced the k,JK^.^ of (i-tethered CMMs but decreased those of a- 
tethered. Consistent with its surface exposed nature, changes at position 156 were more 
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modesi, but still allowed control of activity particularly through glycosylation with 
disaccharide lactose. These results illustrated the great potential for tailoring activity 
through the correct choice of glycan and glycosylation site. 

The ability of the glycosylation method to glycosylate the binding pockets 
5 of SBL also creates opportunities to broaden its substrate specificity. For instance, an 

array of hydrogen bonding hydroxyl groups may broaden its specificity towards hydrogen 
bonding substrates such as glycosylated amino acids. Subtilisins have been elegantly 
used to catalyze the synthesis of glycopeptides (Witte et aL, ''Solution- and Solid-Phase 
Synthesis of N-protected Glycopeptide Esters of the Benzyl Type as Substrates for 

1 0 Subtilisin-Catalyzed Glycopeptide Couplings/' J. Am. Chem. Soc. 1 20: 1 979- 1 989 

(1998); Wong et al., "Enzymatic-Synthesis of N-Linked and O-Linked Glycopeptides," T 
Am. Chem. Soc. 1 15:5893-5901 (1993), which are hereby incorporated by reference). 
However, the natural specificity of these enzymes has limited these peptide ligations to 
those in which the glycosylated residues are at least one residue distant (P,,?.. .. or 

15 P^'^P/...) from the amide bond formed. For example, while ligation of Z-Gly-OBz with 
H-Gly-Ser(Ac3GlcNAcp)-NH2 was successful, no yield of product was obtained with H- 
Ser(Ac3GlcNAcP)-NH, (Witte et aL, J. Am. Chem. Soc . 120:1979-1989 (1998), which is 
hereby incorporated by reference). The introduction of sugars to the Sj and S,' subsites as 
hydrogen bonding groups demonstrated here may enhance the specificity of proteases 

20 towards hydrophilic substrates. 

Furthermore, by choosing carbohydrate attachments that differ from each 
other at only one stereocenter, SARs may be determined by examining changes in activity 
as the nature of sugar side chain is varied. For example, the effect of inverting 
stereocenters in the order C-4^C-l^C-2 can be determined using CMMs in the series e 

25 -> c-> b-> d (See Figure 8). While the current illustrations have been with SBL as a 

protein example, the method is clearly amenable to the glycosylation of any protein and is 
without limitation with respect to the sites and to the glycans that may be conjugated. It 
will, therefore, allow the introduction of any therapeutically important carbohydrate 
recognition determinant, of which the P-D-galactopyranosyl moiety of e and f (See Figure 

30 8) that represents a ligand of the hepatic asialoglycoprotein receptor (Sharon et al.. Essays 
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Biochem. , 30:59-75 (1995), which is hereby incorporated by reference) is just one 
example. 

Example 12 - Esterase Screen 

5 Specificity constants determined using the low substrate approximation 

were measured indirectly using Ellman's reagent (Ellman et al., Biochem. Pharmacol. . 
7:88-95 (1961). which is hereby incorporated by reference) (£412 = 13600 M 'cm ') using 
0.15 and 0.30 mM succinyl-AAPF-SBn as substrate in 0.1 M Tris.HCL containing 0.005 
vol% DMSO, 1 vol% 37.5 mM Ellman's reagent in DMSO, pH 8.6. 

10 

Example 13 — Full Esterase Kinetics Measurements 

Michaelis-Menten constants were measured at 25 °C by curve fitting 
(Grafit'^' 3.03, Erithacus Software Ltd., Staines, Middlesex, UK) of the initial rate data 
determined at eight concentrations (31.25 |LiM - 3 mM) of the succinyl-AAPF-SBn 
15 substrate, followed indirectly using Eliman's reagent in 0.1 M Tris.HCl, containing 0.005 
vol% DMSO, 1 vol% 37.5 mM Ellman's reagent in DMSO, pH 8.6. 

Example 14 - Esterase Activity Screen 

The glyco-CMMs shown in Table 4 were prepared, by reacting reagents 
20 la-k (See Figure 8) with the chosen cysteine mutants SBL-N62C, -S156C, -S166C and - 
L217C, purified and extensively characterized as described previously. 
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Table 4. 
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'Kinetic constants determined in duplicate using the low substrate concentration approximation in 0. 1 M Tris buffer, 
pH 8.6. 0.005% Tvveen 80. 1% DMSO with suc-AAPF-SBn as substrate. [S] ^ 15 or 30 ^M. [E] = 4.8 x 10 " to 6.0 



5 xl0-"'M. 

E/A = KK JK^)^^ f {kJK^,)^^. 
' For SBL-WT: ;t,^//r^ = 3592.5 mM-'s*'. E/A = 1 7. 



The kinetic parameters of esterase activity were determined at pH 8.6 by indirectly 
10 following the release of thiobenzyl alcohol from the substrate succinyl-Ala-Ala-Pro-Phe- 
SBn (suc-AAPF-SBn) with Ellman's reagent (Ellman et al., Biochem. Pharmacol. . 7:88- 
95 (1961), which is hereby incorporated by reference). To allow a rapid screen of 
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esterase activity, a low substrate concentration ([S] « K^,) was used that allowed k^JK_^i 
to be determined directly from the initial rate of reaction. The results from the screen are 
shown in Table 4. 

Modification at position 62, in the S, pocket, with deprotected sugar 

5 reagents Ib-f (See Figure 8) increased k^JK^^,. resulting in a series of five enzymes that 
had similar kJK,,s that were 1 .3- to 1 .9-fold greater than WT (Figure 13 A). The 
presence of an a-linkage w^as clearly deleterious to activity, as N62C-SEtpGlc (-c) (See 
Figure 8) had a k,JK,f 1.2-fold greater than its epimer N62C-SEtaGlc (-b) (See Figure 8) 
and 1.9-fold greater than WT. Furthermore, the a-linked N62C-SEtaMan (-d) (See 

10 Figure 8) had the lowest kJK^^ in this group which was 1 .3-fold greater than WT. 

As at position 62, the introduction of any of the deprotected sugar side 
chains b-f (See Figure 8) at position 217, in the S,' pocket increased kJK,yf{V\gurQ 13B). 
However, the effects of glycosylation at this site were far more dramatic as demonstrated 
by a kJK^i for L217C-SEtpGal (-e) (See Figure 8) that was 3.4-fold greater than WT. By 

15 comparing the kJK,,s of L217C-SpGlc and L217C-SEtpGlc (-c) (See Figure 8), it was 
possible to gauge the effect on activity of introducing an ethyl tether at this position. This 
introduction increased kJK^.^, from 1.8-fold greater than WT for L217C-SpGlc to 2.7- 
fold greater than WT for L21 7C-SEtpGlc (-c) (See Figure 8). As at position 62, p-linked 
glyco-CMMs (-c,-e,-f) (See Figure 8) had higher kJK>,iS than the a-linked ones (-a,-d) 

20 (See Figure 8). For example, L21 7C-SEtpGlc (-c) (See Figure 8) had a kJK,, L3-fold 
greater than L217C-SEtaGlc (-b) (See Figure 8). 

Consistent with the surface exposed orientation of the SI 56 side chain, the 
S156C deprotected glyco-CMMs had similar Ka/K^.fS that were 1 .3- to 2.1 -fold lower than 
WT (Figure 13C). 

25 At position 166, in the S, pocket, mutation to cysteine resulted in an 

enzyme with a severely lowered kJK,^^ that was 10-fold lower than WT. However, 
subsequent modification with 1 b-f (See Figure 8) restored much of the catalytic activity 
(Figure 13D), and the S166C deprotected glyco-CMMs S166C-S-b-f (See Figure 8) had 
similar k^JK^^^, that varied from 1.1- to 1 .4-foId lower than WT. 
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Example 15 - Kinetic Effects of Glycosylation with Acet>'lated Carbohydrates. 

At position 62. in the S, pocket, in sharp contrast to the trend observed for 
the unacetylaied N62C CMMs. acetylated N62C CMMs had a wide range ofkJK.fi. 
Introduction of acetates increased the k^ JK,^ of the a-linked CMMs relative to the 
corresponding deprotected glyco-CMMs (Figure 14A). Thus, N62C-SEtaGlc(Ac), (-b) 
(See Figure 8) and N62C-SEtaMan(Ac), (-d) (See Figure 8) had k„JK,^ 1 .9- and 1 .6-foid 
greater than WT, respectively. Acetylation was clearly deleterious for p-linked CMMs. as 
N62C-SEtpGal(Ac),, N62C-SEtpGlc(Ac),, and N62C-SEtpGlc(Ac), all had k^JK,^ 
lower than WT. However, increasing the number of acetates present on the CMM 
restored activity: N62C-SEtpGlc(Ac), had a k^JK,, only 1.3-fold lower than WT and 1.5- 
fold higher than N62C-SEtpGlc(Ac),. In spite of their size, the sterically bulky side chain 
lactosylated N62C CMMs, N62C-SEtLac (-f) (See Figure 8) and N62C-SEtLac(Ac), (-k) 
(See Figure 8) had k,JK,^ that were similar to those of the CMMs derived from' 
monosaccharides. This provided a clear example of the versatility of the glycosylation 
method illustrated in Figure 8 and demonstrated that by using this method it was possible 
to introduce very large structures into the active site of SBL while maintaining catalytic 
competency. 

Modification with acetylated reagents la,g-k (See Figure 8) at position 
217, in the S," pocket, as with Ib-k (See Figure 8), led to CMMs with greater than WT 
KJK^fi (Figure 14B). For the untethered glyco-CMMs, increasing the number of acetates 
dramatically increased k,JK^ from 1.8-fold greater than WT for L217C-SpGlc to 2.4- 
fold greater than WT for L217C-SpGlc(Ac), and to 3.2-fold greater than WT for L217C- 
SpGlc(Ac)3 and mirrored the trend seen in amidase kinetics. For the ethyl linked L217C 
glyco-CMMs, the effect of acetylation was dependent on the anomeric stereochemistry, as 
observed for N62C glyco-CMMs and the L217C glyco-CMMs amidase kinetics. 
Acetylation of a-linked CMMs increased k,JK„ but decreased kJK^, for P-linked 
CMMs. This was most pronounced for L2 1 7C-SEtpGal(Ac)3 which had a k„JK„ only 
1 . 1 -fold greater than WT and 3. 1 -fold lower than L2 1 7C-SEtpGal (-e). In contrast to the 
effect oa deprotected L21 7C glyco-CMMs, the activity of acetylated L217C glyco-CMMs 
decreased upon the introduction of the ethyl linker. For example, L21 7C-SpGlc(Ac), had 
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a kJK„ 3.2-fold greater than WT, as compared with the 1 .4-fold greater than WT kJK,„ 
of L217C-SEtpGlc(Ac)v 

At position 1 56, in the S, poci^et, the SI 56C acetylated glyco-CMMs 
displayed httle difference in their kinetic constants from their unacetylated counterparts 

5 (Fieure 14C), an observation that was consistent with the surface exposed nature of the 
position 156 side chain. Introducing the ethyl linker led to an increase in kJK^, from 2.6- 
fold lower than WT for S156C-S[3Glc(Ac), (-a) (See Figure 8) to 1.5-fold lower than WT 
for S156C-SEtpGlc( Ac), (-h) (See Figure 8). 

In general, at position 166, in the S, pocket, the effect of acetylation on 

10 SI 66C ethyl linked glyco-CMMs was to reduce kJK^iS relative to their deprotected 

counterparts (Figure 14D). The exceptions were the ethyl linked P-gluco-CMMs, S166C- 
SEtpGlc(Ac), and S166C-SEtpGlc(Ac)5, which displayed kJK„s 1.4- and 1.7-fold 
greater than WT, respectively. These were the only two glyco-CMMs prepared at this site 
to show an enhancement in kJKi^i relative to WT, and this example illustrates that the 

15 correct selection of sugar is crucial to the tailoring of enzyme activity. In contrast to the 
effects observed at positions 62 and 217, an a-linkage to the sugar moiety was deleterious 
to the activity of the acetylated CMMs and S166C-SEtaGlc(Ac)3 had a kJK,, 1.9-fold 
lower than WT, in direct contrast to S166C-SEtpGlc(Ac)3. Introduction of the sterically 
bulky lactose moiety, in both acetylated and unacetylated forms, led to CMMs S166C- 

20 SEtLac (-f) (See Figure 8) and S 1 66C-SEtLac(Ac)5 with low kJK.fi that were 1 .4- and 
2.3-fold lower than WT, respectively. 

Example 16 - Full Esterase Kinetics. 

The three esterases with the highest kJK^ determined by the above 
25 screen, L2 1 7C-SpGlc(Ac)3, L2 1 7C-SEtaMan(Ac).„ and L2 1 7C-SEtpGal (-e) (See Figure 
8) had their individual k,„,s and K^,s determined by the initial rates method. The results 
are shown in Table 5. 
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Table 5. 



Full Esterase Kinetic Data for Glycosylated CMiMs' 



Enzyme 






(mM's') 


-WT 


E/A" 


WT 

L217C-SpGlc(Ac), 
L2l7C-SEtaManCAc)-, 
L217C-SetpGal 


1940.0±180 
4427.5±100.9 

3827.0±59.5 
4398.5±189.8 


0.54±0.07 
0.15±().01 
0.30±0.01 
0.36±0.04 


3592.5+572.7 
29516.7±2079.0 

I2756.7±469.2 
12218.1±1456.3 


1 

8.4 
3.6 
3.5 


17 
293 

77 
167 


' Kinetic constants determined by meihoc 


of initial rates in 0. 1 M Tris buf 


fer, pH 8.6, 



0.005% Tvveen 80, 1% DMSO with suc-AAPF-SBn as substrate. [S] = 30 to 2 mM 
[E]= 9.6 X 10 " to 1.1 X lO '^M. 
" E/A = (kJK,,) 

esterase ^ i^aJ amidase- 



The results were in good agreement with those determined by the screen 
for L217C-SEtaMan(Ac)3 and L217C-SEtpGal (-e) (See Figure 8) and confirmed the 
activity of these two enzymes to be 3,6- and 3.5- fold higher than WT, respectively. 
These increases in activity arose from both increased transition state stabilization, with 
kcar^ 2- and 2.3-foId greater than WT, respectively, and from greater substrate binding, 
with K^^ ] .8- and 1 .5-foId lower than WT, respectively. : 

Remarkable results were obtained for L217C-SpGlc(Ac)v This enzyme 
had a k^.^, 2.3-fold greater than WT and a A',/ 3-6-fold lower than WT, giving a k^JK,^ 8.4- 
fold greater than WT and some 2.5-fold greater than the value estimated by the screen. 
The difference in parameters obtained from the screen and the full kinetic analysis 
exposed a limitation of the low substrate screen. For the low substrate approximation to 
be accurate, the substrate concentration must be small compared to Kj,f. The Kj,^ of 
L217C-SpGlc(Ac)3 (0.15 mM) was evidently so small that the approximation did not hold 
in this case. This was the largest enhancement of activity relative to WT achieved using 
the combined site-directed mutagenesis and chemical modification strategy. 

Example 17 - Esterase Activity versus Amidase Activity. 

The differing effects of glycosylation upon amidase and esterase KJK^f 
can be compared in an informative manner using the (kJKJ^,^^ I {kJK^i)^,^^ ratio, 
E/A (see Figure 15). 
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All N62C deprolected glyco-CMMs had E/As that were enhanced relative 
lo WT. The increase in E/A was dependent on the presence of either an a- or a P-linkage 
with the p-linked CMMs N62C-SEtpGlc (-c) (See Figure 8). -SEtpGal (-e) (See Figure 
8). -SEtLac (-f) (See Figure 8) having higher E/As than the a-linked CMMs N62C- 
SEtaGic (-b) (See Figure 8) and -SEiaMan (-d) (See Figure 8) (Figure 15 A). These 
increased ratios were due to both increases in esterase kJK,,s and reductions in amidase 

As observed for the modifications made at position 62, glycosylation at 
position 217, in the S,' pocket, led to enzymes that had greatly increased E/A ratios 
relative to WT. Mutation to cysteine at position 217 increased E/A to 6.4-fold greater 
than WT. Modification with unacetylated P-linked sugars -SPGlc, -SEtpGlc (-c) (See 
Figure 8), -SEtpGal (-e) (See Figure 8), -SEtLac (-f) (See Figure 8) further increased E/A 
(Figure 15B). In contrast, a-linked glyco-CMMs had lower E/A values than that of the 
mutant. These changes in E/As were a result of parallel changes in esterase k,JK^,5 that 
were further amplified by opposing changes in amidase kJK:,fi. Introduction of an ethyl 
linker reduced the E/A from 10.9-fold greater than WT for L217C-SpGlc to 7.6-fold 
greater than WT for L217C-SEtpGlc (-c) (See Figure 8). 

At position 156, in the S, pocket, E/A ratios for unacetylated glyco-CMMs 
were all similar to each other in the range 1 .2- to 1 .6-fold greater than WT (Figure 15C). 

) The mutant S 1 66C had an exceptionally low E/A that was 4.2-fold lower 

than WT (Figure 15D) and this was largely a result of its very low esterase kJK^. 
- Because modification of S166C restored esterase kJK,^ to levels approaching that of 
WT and because S166C glyco-CMMs were poor amidases relative to WT, the net result 
was a family of CMMs with similar E/A ratios that were all enhanced relative to WT and 

5 significantly higher than the cysteine mutant (Figure 15D). 

Example 18 - Effects of Glycosylation with Acetylated Carbohydrates on E/A. 

At position 62, in the S, pocket, with the exception of N62C-SEtpGal(Ac)3 
and N62C-SEtaMan(Ac)4 (-i) (See Figure 8), acetylation of N62C glyco-CMMs led to a 
0 reduction in E/A. However, like their deprotected counterparts, the acetylated N62C 
glyco-CMMs all had larger E/As than WT (Figure 16A). Increasing the level of 
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acetylation increased the E/A ratio from 2.1 -fold greater than WT for N62C- 
SEtpG[c(Ac), to 3-0-fold greater than WT for N62C-SEtpGlc(Ac)v In spite of the 
general increases in E/A observed for the ethyl linked glyco-CK4Ms, the unteihered CMM 
N62C-SpGlc(Ac), (-a) (See Figure 8) had an E/A very similar to WT. 
5 At position 217, in the S,' pocket, L217C acetylated glyco CMMs all had 

enhanced E/As relative to Wl^ (Figure 16B). As for their deprotected counterparts, P- 
linked acetylated glyco-CMMs had higher E/As than that of the L217C mutant, whereas 
the E/As of the a-linked acetylated glyco-CMMs were lower than that of the mutant. In 
fact, modification at this site produced CMMs with by far the greatest enhancement of 

10 this ratio and the E/A for L2 1 7C-SpGlc(Ac)3 was 1 7.2-foId greater than WT. In contrast 
to the L217C deprotected glyco-CMMs, introducing an ethyl linker lowered E/A from 
17.2-fold greater than WT for L217C-SpGlc(Ac)3 to 9.2-fold greater than WT for L217C- 
SEtpGlc(Ac)3. For the untethered L217C acetylated glyco-CMMs, increasing the number 
of acetates also increased E/A, from 5.0-fold greater than WT for L217C-SEtpGlc(Ac)2 to 

15 1 7.2-fold greater than WT for L21 7C-SEtpGlc(Ac)3. 

At position 156, in the S, pocket, acetylation of a-linked glyco CMMs 
increased E/As (Figure 16C). Hence, S156C-SEtpGlc(Ac), (-h) (See Figure 8) had an 
E/A that was 1.5 fold greater than WT whereas 81 56C-SEtaGlc(Ac)4 (-g) (See Figure 8) 
had an E/A 2.0-fold greater than WT. S156C-SEtaMan(Ac)4 (-i) (See Figure 8) also had 
20 an E/A L3-fold greater than that of its unacetylated counterpart, S156C-SEtaMan (-d) 
(See Figure 8). 

At position 166, in the S, pocket, acetylation caused an increase in E/A for 
the glucosylated CMMs, irrespective of the anomeric stereochemistry (Figure 16D). 
Acetylation of all other sugar moieties led to a reduction in E/A. Increasing the number 
25 of acetates increased E/A from 2.7-fold greater than WT for S166C-SEtpGlc(Ac), to 3.2- 
fold greater than WT for S166C-SEtpGlc(Ac)3. 



Example 19 — Molecular Modeling 

The X-ray structure of sublilisin Bacillus lentus with the peptide inhibitor 
30 AAPF bound (Brookhaven database entry 1 JEA) was used as the starting point for 

calculations on wild type and CMMs. The enzyme setup was performed with Insight II, 
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version 2.3.0 (Biosym Technologies, Inc. San Diego, CA). To create initial coordinates 
for the minimization, hydrogens were added at the pH 8.6 used for kinetic measurements. 
This protonated all Lys and Arg residues and the N-terminus and deprotonated all Glu and 
Asp residues and the C-terminal carboxyl group. The protonated form of His 64 was used 
i in all calculations. The model system was solvated with a 5 A layer of water molecules. 
The total number of water molecules in the system was II 43. The overall charge of the 
enzyme-inhibitor complex resulting from this setup was +4 for the WT enzyme. Energy 
simulations were performed with the DISCOVER program. Version 2.9.5 (Biosym 
Technologies, Inc., San Diego, CA) on a Silicon Graphics Indigo computer, using the 
) - consistent valence force field (CVFF) function. A non-bonded cutoff distance of 1 8 A 
with a switching distance of 2 A was employed. The non-bonded pair list was updated 
every 20 cycles and a dielectric constant of 1 was used in all calculations. The WT 
enzyme was minimized in stages, with initially only the water molecules being allowed to 
move, followed by water molecules and the amino acid side chains, and then finally the 
5 entire enzyme. The mutated and chemically modified enzymes were generated by 

modifying the relevant amino acid using the Builder module of Insight. These structures 
were then minimized in a similar manner. Initially, the side-chain of the mutated residue 
and the water molecules were minimized. Then, all side-chains and the water molecules 
were minimized while the backbones of the residues were constrained, then all of the 
0 ^ atoms were minimized. The AAPF inhibitor was free to move throughout all stages of 
^ the minimization. Each stage of energy minimization was conducted by means of the 
^ method of steepest descents without Morse or cross terms until the derivative of energy 
with respect to structural perturbation was less than 5.0 kcal/A; then, the method of 
conjugate gradients, without Morse or cross terms until the derivative of energy with 
i5 respect to structural perturbation was less than 1 .0 kcal/A; and, finally, the method of 
conjugate gradients, with Morse and cross terms until the final derivative of energy with 
respect to structural perturbation was less than 0.1 kcal/A. 

The molecular basis for the vastly improved esterase activities observed 
was analyzed by molecular modeling of the peptidyl product inhibitor AAPF bound to the 
50 SBL-CMM L217C-SpGlc(Ac)3. While the substrate employed for kinetic analysis is 
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succinylated, this moiety was not included in molecular modeling since its orientation 
was not reported in the X-ray structure of SBL, suggesting high mobility in the crystal. 
As shown in Figures 17 and 1 8, molecular modeling analysis of L21 7C- 
SpGlc(Ac)^ revealed that the observed kJK^ changes correlate with the occupation of the 
Sj' pocket of SBL by the glucosylated -SpGlc(Ac); side chain. In the minimized structure 
(Figure 1 7), the position of the glucose moiety was fixed by a network of hydrogen- 
bonding interactions (shown as white doited lines) between the oxygen atoms on the C-3. 
4, and 6 substituents of glucose and water molecules in the surrounding external solvent. 
This extensive solvation directed the C-2 substituent of glucose internally towards the 
catalytic triad. In this orientation, the carbonyl oxygen atom of the C-2 acetate group acts 
a hydrogen bond (1 .89 A) acceptor and stabilizes water molecule 127 in close proximity 
to the carboxy terminus of AAPF, used here as a substrate analog for modeling, and is 
shown in Figure 1 7 hydrogen-bonding to the O atom of the carboxylic acid. 

These results suggested that firstly the low amidase activity of L217C-S- 
Glc(Ac)3, which was 2-fold lower than WT, was a result of the S,' pocket being occupied 
by the glucose moiety at position 217. This prevents efficient binding of the /?NA leaving 
group and, therefore, decreases the rate of acyl-enzyme intermediate formation which is 
the rate-determining step for amidase activity. Secondly, after the /?NA has been 
displaced to form the covalent Acyl-Ser221 intermediate, the glucose moiety stabilizes a 
crucial, nucleophilic water molecule (Wat 127) in close proximity to the carbonyl carbon 
atom, through a hydrogen-bond to the oxygen of the C-2 acetate of glucose, as illustrated 
in Figure 18. This facilitates hydroylsis of the acyl-enzyme intermediate and therefore 
increases the rate of deacylation, which is the rate limiting step for esterase activity 
(Zemer et al., J. Am. Chem. Soc . 86:3674-3679 (1964); Whitaker et al., J. Am. Chem. 
Soc,, 87:2728-2737(1965); Berezin et al., FEBS Lett. . 15:121-124 (1971), which are 
hereby incorporated by reference). 

Glycosylation of SBL at sites within the active site can dramatically 
enhance its esterase activity. The library of glycosylated CMMs synthesized using the 
combined site directed mutagenesis and chemical modification strategy contains 22 
CMMs with greater than WT activity. Glycosylation at positions 62, in the pocket, and 
2 1 7, in the S , ' pocket, gave the greatest increases in k^JK^.^, The most active CMM 
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L217C-SpGlc(Ac), had a kJK„ that was 8.4-fold greater that WT and was the most 
active esterase synthesized using this approach. When surface exposed position 1 56 was 
olvcosvlaied. there was little alteration in activity, and this demonstrates that the 
introduction of sugars at such sites has little effect on the catalytic activity of SBL. 

5 In addition to the tailoring of esterase kJK„ values, glycosylation also led 

to enormous improvements in specificity for ester versus amide hydrolysis, as determined 
by measurement of the ratio {kJK,;)„ I {kJK„U^,,,, E/A. This ratio has been 
increased to 1 7.2-fold greater than WT for L21 7C-S(3Glc(Ac),. Such enzymes are very 
attractive candidates for use in peptide synthesis, where a high esterase to amidase ratio is 

10 desirable. Furthermore, the CMMs described above have an even greater potential for 
this purpose, as the increases in E/As have been achieved by an increase in the catalytic 
efficiency of these enzymes towards ester substrates, in addition to a reduction in the 
amidase activity. 

Although the invention has been described in detail for the purpose of 
1 5 illustration, it is understood that such detail is solely for that purpose, and variations can 
be made therein by those skilled in the art without departing from the spirit and scope of 
the invention which is defined by the following claims. 
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WHAT IS CLAIMED : 

1. A chemically modified mutant protein, said mutant protein 
comprising a cysteine residue substituted for a residue other than cysteine in a precursor 
protein, the substituted cysteine residue being subsequently modified by reacting said 
5 cysteine residue with a glycosylated thiosulfonate. 



2. A chemically modified mutant protein according to claim K 
wherein the protein is an enzyme. 

10 3. A chemically modified mutant protein according to claim 2, 

wherein the enzyme is a protease. 

4. A chemically modified mutant protein according to claim 3, 
wherein the protease is a Bacillus lentus subtilisin. 

15 

5. A chemically modified mutant protein according to claim L 
wherein said thiosulfonate comprises an aikylthiosulfonate. 

6. A chemically modified mutant protein according to claim 5, 
wherein said aikylthiosulfonate comprises methanethiosulfonate. 

20 7. A chemically modified mutant protein according to claim 1, 

wherein the residue other than cysteine is an amino acid selected from the group 
consisting of asparagine, leucine, and serine. 



8. A chemically modified mutant protein according to claim 1, 

25 wherein the residue other than cysteine is in a substrate binding subsite of the protein. 

9. A chemically modified mutant protein according to claim 1, 
wherein the glycosylated thiosulfonate comprises a thiol side chain comprising -S-P-Glc, 
-S-Et-P-Gal, -S-Et-P-Glc, -S-Et-a-Glc, -S-Et-a-Man, -S-Et-Lac, -S-p-GlcCAc),, 

30 -S-p-Glc(Ac)3, -S-p-Glc(Ac)4, -S-Et-a-Glc(Ac),, -S-Et-a-Glc(Ac)3, -S-Et-a-GlcCAc)^, 
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-S-Et-p-Glc(Ac),, -S-Et-p-Glc(Ac)3, -S-Et-p-Glc(Ac),, -S"Et-a-Man(Ac),, 
-S-Et-a-Man(Ac)„ -S-Et-p-Gal(Ac);,, -S-Et-p-Gal(Ac),, "S-Et-Lac(Ac),, -S-Et-Lac(Ac),, 
or -S-Et-Lac(Ac)7. 

10- A chemically modified mutant protein according to claim 1, 
5 wherein the carbohydrate moiety is a dendrimer moiety. 

11. A method of producing a chemically modified mutant protein 
comprising the steps of (a) providing a precursor protein; (b) substituting an amino acid 
residue other than cysteine in said precursor protein with a cysteine; (c) reacting said 
substituted cysteine with a glycosylated thiosulfonate, said glycosylated thiosulfonate 

10 comprising a carbohydrate moiety; and (d) obtaining a modified glycosylated protein 
wherein said substituted cysteine comprises a carbohydrate moiety attached thereto. 

12, A method according to claim 1 1, wherein said thiosulfonate 
comprises an alkylthiosulfonate. 



15 



20 



25 



13. A method according to claim 12, wherein said alkylthiosulfonate 
comprises a methanethiosulfonate. 

14. A method according to claim 1 1 , wherein the protein is an enzyme. 

15. A method according to claim 14, wherein the enzyme is a protease. 

16. A method according to claim 15, wherein the protease is a Bacillus 
lentus subtilisin. 

17. A method according to claim 1 1, wherein the amino acid residue 
other than cysteine is an amino acid selected from the group consisting of asparagine, 
leucine, and serine. 
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1 8. A method according to claim 1 1 , wherein the amino acid residue 
other than cysteine is in a substrate binding subsite of the protein. 

19. A method according to claim 1 1 , wherein the glycosylated 

5 ihiosulfonate comprises a thiol side chain comprising -S-p-Glc, -S-Et-P-GaL -S-Et-P-Glc. 
-S-Et-a-Glc, -S-Ei-a-iVlan. -S-Et-Lac, -S-P-Glc(Ac)., -S-P-Glc(Ac)„ -S-P-Glc(Ac),, 
-S-Et-a-Glc(Ac)., -S-Et-a-Glc(Ac)„ -S-Et-a-Glc(Ac)4, -S-Et-p-Glc(Ac),, 
-S-Et-P-Glc(Ac),, -S-Et-p-Glc(Ac)„ -S-Et-a-ManCAc),, -S-Et-a-Man(Ac),, 
-S-Et-p-Gal(Ac)3, -S-Et-P-Gal(Ac)4, -S-Et-Lac(Ac),, -S-Et-LacCAc)^, or -S-Et-LacCAc);. 

10 

20. A method according to claim 1 1 , wherein the carbohydrate moiety 
is a dendrimer moiety. 

21. A glycosylated thiosulfonate comprising: 

O 
II 

H3C— S— SR 

II 

O 

15 

wherein R comprises -P-Glc, -Et-P-Gal, -Et-p-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 
-p-Glc(Ac)3, -p-Glc(Ac)3, -p-Glc(Ac)4, -Et-a-Glc(Ac)2, -Et-a-Glc(Ac)„ -Et-a-Glc(Ac)„ 
-Et-p-Glc(Ac)2, -Et-P-Glc(Ac)„ -Et-P-Glc(Ac)„ -Et-a-ManCAc),, -Et-a-Man(Ac)„ . 
-Et-p-Gal(Ac)3, -Et-p-Gal(Ac)4, -Et-Lac(Ac)5, -Et-LacCAc)^, or -Et-LacCAc),. 

20 

22. A method of modifying the functional characteristics of a protein 

comprising: 

providing a protein and 

reacting the protein with a glycosylated thiosulfonate reagent under 
25 conditions effective to produce a glycoprotein with altered functional characteristics as 
compared to the protein. 



23. A method according to claim 22, wherein the protein is an enzyme. 
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24. A method according to claim 23. wherein the enzyme is a protease. 

25. A method according to claim 24, wherein the protease is a Bacillus 
leruus subtilisin. 

3 

26. A method according to claim 22, wherein the glycosylated 
thiosulfonate comprises: 

O 

II 

H3C— S— SR 
II 

O 

wherein R comprises -p-Glc, -Et-p-Gal, -Et-p-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 
10 -p-Glc(Ac)„ -p-Glc(Ac)3, -p-Glc(Ac)4, -Et-a-Glc(Ac),, -Et-a-Glc(Ac)., -Et-a-Glc(Ac),, 
-Et-p-Glc(Ac)„ -Et-p-Glc(Ac),, -Et-p-GlcCAc)^, -Et-a-ManCAc),, -Et-a-Man(Ac),, 
-Et-P-Gal(Ac)3, -Et-p-Gal(Ac)4, -Et-Lac(Ac)„ -Et-LacCAc)^, or -Et-LacCAc)^. 

27. A method of determining the structure-function relationships of 
15 chemically modified mutant proteins comprising: 

providing first and second chemically modified mutant proteins according 
to claim 1. wherein the glycosylation pattern of the second chemically modified mutant 
protein is different from the glycosylation pattern of the first chemically modified mutant 
. protein; 

20 evaluating a functional characteristic of the first and second chemically 

modified mutant proteins; and 

correlating the functional characteristic of the first and second chemically 
modified mutant proteins with the structures of the first and second chemically modified 
mutant proteins. 

25 

28. A method according to claim 27, wherein the protein is an enzyme. 

29. A method according to claim 28, wherein the enzyme is a protease. 
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30. A method according to claim 29, wherein the protease is a Bacillus 
lentits subtilisin. 

31. A method according to claim 27. wherein the residue other than 
cysteine is an amino acid selected from the group consisting ofasparagine, leucine, and 
serine. 

32. A method according to claim 27, wherein the residue other than 
cysteine is in a substrate binding subsite of the protein. 

33. A method according to claim 27, wherein the glycosylated 
thiosulfonate comprises a thiol side chain comprising -S-p-Glc, -S-Et-p-Gai, -S-Et-p-Glc, 
-S-Et-a-Glc, -S-Et-a-Man, -S-Et-Lac, -S-p-Glc(Ac)., -S-P-Glc(Ac)„ -S-p-Glc(Ac)„ 
-S-Et-a-Glc(Ac)„ -S-Et-a-Glc(Ac)3, -S-Et-a-Glc(Ac)4, -S-Et-p-GlcCAc)., 
-S-Et-P-Glc(Ac)3, -S-Et-p-Glc(Ac)„ -S-Et-a-ManCAc),, -S-Et-a-Man(Ac)4, 
-S-Et-p-Gal(Ac)3, -S-Et-p-Gal(Ac)„ -S-Et-Lac(Ac)3, .S-Et-Lac(Ac)„ or -S-Et-LacCAc),. 

34. A method according to claim 27, wherein the carbohydrate moiety 
is a dendrimer moiety. 

35. A method of determining the structure- function relationships of 
chemically modified mutant proteins comprising; 

providing first and second chemically modified mutant proteins according 
to claim 1, wherein at least one different cysteine residue in the second chemically 
modified mutant enzyme is modified by reacting said cysteine residue with a glycosylated 
thiosulfonate; 

evaluating a functional characteristic of the first and second chemically 
modified mutant proteins; and 

correlating the functional characteristic of the first and second chemically 
modified mutant proteins with the structures of the first and second chemically modified 
mutant proteins. 
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36. A method accordine to claim 35. wherein the protein is an enzyme. 



37. A method according to claim 36. wherein the enzyme is a protease. 

5 38. A method according to claim 37, wherein the protease is a Bacillus 

lentus subtilisin. 

39. A method according to claim 35, wherein the residue other than 
cysteine is an amino acid selected from the group consisting of asparagine, leucine, and 

10 serine. 

40. A method according to claim 35, wherein the residue other than 
cysteine is in a substrate binding subsite of the protein. 

15 41 . A method according to claim 35, wherein the glycosylated 

thiosulfonate comprises a thiol side chain comprising -S-p-Glc, -S-Et-(3-Gal, -S-Et-p-Glc. 
-S-Et-a-Glc, -S-Et-a-Man, -S-Et-Lac, -S-p-Glc(Ac)., -S-p-Glc(Ac),, -S-P-Glc(Ac)„ 
-S-Et-a-Glc(Ac),, -S-Et-a-GlcCAc),, -S-Et-a-Glc(Ac)„ -S-Et-p-Glc(Ac)„ 
-S-Et-p-Glc(Ac)„ -S-Et-p-Glc(Ac)„ -S-Et-a-Man(Ac),, -S-Et-a-Man(Ac),, 

20 -S-Et-P-GaI(Ac)„ -S-Et-P-Gal(Ac)„ -S-Et-LacCAc),,, -S-Et-Lac(Ac)e, or -S-Et-Lac(Ac),. 

42. A method according to claim 35, wherein the carbohydrate moiety 
is a dendrimer moiety. 
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FIGURE 13 
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FIGURE 14 
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FIGURE 15 
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FIGURE 16 
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